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Description 

TECHNICAL FIELD 

s [0001] The present invention relates generally to methods and compositions for analyzing nucleic acid molecules, 
and more specifically to tags which may be utilized in a wide variety of nucleic acid reactions, wherein separation of 
nucleic acid molecules based on size is required. 

BACKGROUND OF THE INVENTION 

10 

[0002] Detection and analysis of nucleic acid molecules are among the most important techniques in biology. Such 
techniques are at the heart of molecular biology and play a rapidly expanding role in the rest of biology. 
[0003] Generally, one type of analysis of nucleic acid reactions involves separation of nucleic acid molecules based 
on length. For example, one widely used technique, polymerase chain reaction (PCR) (see, U.S. Patent Nos. 4,683,1 95, 
>5 4,683,202, and 4,800,159) has become a widely utilized technique to both identify sequences present in a sample and 
to synthesize DNA molecules for further manipulation. 

[0004] Briefly, in PCR, DNA sequences are amplified by enzymatic reaction that synthesizes new DNA strands in 

oiti ; or a goot:i --trio or Lnoar fashion. Following amplification, the DNA sequences must be detected and identified. 
Because of non-specific amplifications, which would otherwise confuse analysis, or the need for purity, the PCR reaction 

?0 products are generally subjected to separation prior to detection. Separation based on the size (i.e., length) of the 
products yields the most useful information. The method giving the highest resolution of nucleic acid molecules is 
eiectrophoretic separation. In this method, each individual PCR reaction is applied to an appropriate gel and subjected 
to a voltage potential. The number of samples that can be processed is limited by the number of wells in the got. On 
most gel apparatus, from approximately 10 to 64 samples can be separated in a single gel. Thus, processing large 

25 numbers of samples is both labor and material intensive. 

[0005] Eiectrophoretic separation must be coupled with some detection system in order to obtain data. Detoction 
systems of nucleic acids commonly, and almost exclusively, utilize an intercalating dye or radioactive label, and less 
frequently, a non-radioactive label. Intercalating dyes, such as ethidium bromide, are simple to use. The dye is included 
in Ifte gel matrix during electrophoresis or, following electrophoresis, the gel is soaked in a dye-containing soiution. 

Jo The dye can be directly visualized in some cases, but more often, and for ethidium bromide In particular, is excited by 
tight {e.g., UV) to fluoresce, in spite of this apparent ease of use, such dyes have some notable disadvantages. First, 
the dyes are insensitive and there must be a large mass amount of nucleic acid molecules in order to visualize the 
products. Second, the dyes are typically mutagenic or carcinogenic. 

[0006] A more sensitive detection technique than dyes uses a radioactive (or nonradioactive) label. Typically, either 
3S a radiolabeled nucleotide or a radiolabeled primer is included in the PCR reaction. Following separation, the radioJabe) 
is "visualized" by autoradiography. Although more sensitive, the detection suffers from film limitations, such as reci- 
procity failure ano non-linearity. These limitations can be overcome by detecting the label by phosphor image analysis 
However, radiolabels have safety requirements, increasing resource utilization and necessitating specialized equip- 
ment and personnel training. For such reasons, the use of nonradioactive labels has been increasing in popularity. In 
40 such systems, nucleotides contain a label, such as afluorophore, biotin ordigoxm, which can be detected by an antibody 
or other molecule {e.g., other member of a ligand pair) that is labeled with an enzyme reactive with a chromogenic 
substrate. These systems do not have the safety concerns a3 described above but use components that are often 
labile and may yield nonspecific reactions, resulting in high background (i.e., low signal-to-noise ratio). 
[0007] WO-A-9 504 160 discloses a tag reagent comprising an analyte moiety having two analyte residues linked to 
4 $ a tag moiety having a reporter group. A library of these reagents is used to sequence nucleic acids. 

[0008] Jacobscn ef al. (G.A.T.A., Vol. 8, p. 223-29, 1 991 ) discloses a tag reagent comprising an inorganic tag. 
[0009] WO-A-9 528 640 discloses a tag reagent and assay system wherein relevant nucleotide groups and corre- 
sponding reporter groups are added either side of a single linker. 

[0010] US-A-5 324 231 discloses a method of detecting RFLP using specific primers labelled with non-radioactive 
so labeis, such as chromophores. 

[0011] The present invention provides novel compositions and methods which may be utilized in a wide variety of 
nucleic acid reactions, and further provides other related advantages. 

SUMMARY OF THE INVENTION 

S3 

[0012] Briefly stated, the present invention provides compositions and methods which may be utilized in a wide 
variety of ligand pair reactions wherein separation of molecules of interest, such as nucleic acid molecules, based on 
size is required. Representative examples of methods which may be enhanced given the disclosure provided herein 
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include PCR, differential display, RNA fingerprinting, PCR-SSCR oligo litations assays, nuclease digestion methods 
(e.g., exo- and endo- nuclease based assays), and dideoxy fingerprinting The methods described herein may be utilized 

n a wide array cf fields, including, for example, in the development of clinical or recearch based diagnostics, the 
determination of polymorphisms, and the development of genetic maps. 

[0013] Within one aspect of the present invention, methods are provided for determining the identity of a nucleic acid 
molecule, comprising the stops of (a) generating tagged nucleic ac;d molecules from one or more selected target 
nucleic acid molecules, wherein a tag is correlative with a particular nucleic acid fragment and detectable by non- 
fluorescent spectrometry or potentiometry (b) separating the tagged fragments by size, (c) cleaving the tags from the 
tagged fragments, and (d) detecting tags by non-fluorescent spectrometry or potentiometry, and therefrom determining 
the identity of the nucleic acid molecules. 

[0014] Within a related aspect of the invention, methods are provided for detecting a selected nucleic acid molecule, 
comprising the steps of (a) combining tagged nucleic acid probes with target nucleic acid molecules under conditions 
and for a time sufficient to permit hybridization of a tagged nucleic acid probe to a complementary selected target 
nucleic acid sequence, wherein a tagged nucleic acid probe is detectable by non-fluorescent spectrometry or potenti- 
ometry, (b) altering the size of hybridized tagged probes, unhybridized probes or target molecules, or the probe.target 
hybrids, (c) separating the tagged probes by size, (d) cleaving tags from the tagged probes, and (e) detecting the tags 
by non-fluorescent spectrometry or potentiometry, and therefrom detecting the selected nucleic acid molecule. 
[0015] Within further aspects methods are provided for genotypmg a selected organism, comprising the steps of (a) 
generating tagged nucleic acid molecules from a selected target molecule, wherein a tag is correlative with a particular 
fragment and may be detected by non-fluorescent spectrometry or potentiometry, (b) separating the tagged molecules 
by sequential length, (c) cleaving the tag from the tagged molecule, and (d) detecting the tag by non-fluorescent spec- 
trometry or potentiometry, and therefrom determining the genotype of the organism. 

[0016] Within another aspect, methods are provided for genotyping a selected organism, comprising the steps of (a) 
combining a tagged nucleic acid molecule with a selected target molecule under conditions and for a time sufficient to 
permit hybridization of the tagged molecule to the target molecule, wherein a tag is correlative with a particular fragment 
and may be detected by non-fluorescent spectrometry or potentiometry, (b) separating the tagged fragments by se- 
quential length, (c) cleaving the tag from the tagged fragment, and (d) detecting the tag by non-fluorescent spectrometry 
or potentiometry, and therefrom determining the genotype of the organism. 

[0017] Within the context of the present invention it should bo understood that "biological samples" include not only 
samples obtained from living organisms [e.g., mammals, fish, bacteria, parasites, viruses, fungi and the like) or from 
the environment {e.g., air, waler or solid samples), but b.clogical materials which may be artificially or synthetically 
produced (o.g., phage libraries, organic molecule libraries, pools of genomic clones, cDNA clones, RNA clones, or the 
like). Representative examples of biological samples include biological fluids {e.g., blood, semen, cerebral spinal fluid, 
urine), biological cells (e.g., stem cells, B or T cells, liver cells, fibroblasts and the like), and biological tissues. Finally, 
representative examples of organisms that may be genotyped include virtually any unicellular or multicellular organism, 
such as warmblooded animals, mammals or vertebrates (e.g., humans, chimps, macaques, horses, cows, pigs, sheep, 
dogs, cats, rats and mice, as well as cells from any of these), bacteria, parasites, viruses, fungi and plants. 
[0018] Within various embodiments of the above-desenbed methods, (he nucleic acid probes and or molecules of 
the present invention may be generated by, for example, a ligation, cleavage or extension (e.g., PCR) reaction. Within 
other related aspects the nucleic acid probes or molecules may be tagged by non-3' tagged oligonucleotide primers 
{e.g., 5'-tagged oligonucleotide primers) or dideoxynucleotide terminators. 

[0019] Within other embodiments of the invention, 4, 5. 10, 15. 20, 25, 30, 35. 40, 45, 50. 60 . 70, SO, 90, 100, 200, 
250, 300, 350, 400, 450, or greater than 500 different and unique tagged molecules may be utilized within a given 
reaction simultaneously, wherein each tag is unique for a selected nucleic acid molecule or fragement, or probe, and 
may be separately identifed. 

[0020] Within further embodiments of the invention, the tag(s) may be detected by fluorometry, mass spectrometry, 
infrared spectrometry, ultraviolet spectrometry, or, potentiostatic amperometry (e.g., utilizing coulometric or ampero- 
metric detectors). Representative examples of suitable spectrometry techniques include time-of-flight mass spectrom- 
etry, quadrupole mass spectrometry, magnetic sector mass spectrometry and electric sector mass spectrometry. Spe- 
cific embodiments of such techniques include ion-trap mass spectrometry, eiectrospray ionization mass spectrometry, 
ion-spray mass spectrometry, liquid ionization mass spectrometry, atmospheric pressure ionization mass spectrometry, 
electron ionization mass spectrometry, fast atom bombard ionization mass spectrometry, MALDI mass spectrometry, 
photo-ionization time-of-flight mass spectrometry, laser droplet mass spectrometry, MALDI-TOF mass spectrometry, 
APCI mass spectrometry, nano-spray mass spectrometry, nebulised spray ionization mass spectrometry, chemical 
ionization mass spectrometry, resonance ionization mass spectrometry, secondary ionization mass spectrometry and 
thermospray mass spectrometry. 

[0021] Within yet other embodiments of the invention, the target molecules, hybridized tagged probes, unhybridized 
probes or target molecules, probe;target hybrids, or tagged nucleic acid probes or molecules may be separated from 
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other molecules utilizing methods which discriminate between the size of molecules (either actual linear size, or three- 
dimensional size). Representative examples of such methods include gel electrophoresis, capillary electrophoresis, 
micro-channel electrophoresis, HPLC, size exclusion chromatography, filtration, polyacrylamide gel electrophoresis, 
liquid chromatography, reverse size exclusion chromatography, ion-exchange chromatography, reverse phase liquid 

5 chromatography, pulsed-fieid electrophoresis, field-inversion electrophoresis, dialysis, and fluorescence-activated liq- 
uid droplet sorting. Alternatively, the target molecules, hybridized tagged probes, unhybridtzed probes or target mole- 
cules, probe:target hybrids, or tagged nucleic acid probes or molecules may be bound to a solid support (e.g., hollow 
fibers (Amicon Corporation, Danvors, Mass.), beads (Polysciences, Warrington, Pa.), magnetic beads (Robbin Scien- 
tific, Mountain View, Calif.), plates, dishes and flasks (Coming Glass Works, Coming, N.Y.), meshes (Becton Dickinson, 

to Mountain View, Calif.), screens and solid fibers (see Edelman et al., U.S. Patent No. 3,843,324; see also Kuroda etyaL, 
U.S. Patent No. 4,416,777), membranes (Millipore Corp., Bedford, Mass.), and dipsticks). If the first or second member, 
or exposed nucleic acids are bound to a solid support, within certain embodiments of the invention the methods dis- 
closed herein may further comprise the step of washing the solid support of unbound material. 
[0022] Within other embodiments, the tagged nucleic acid molecules or probes may be cleaved by a methods such 

ts as chemical, oxidation, reduction, acid-labile., base labile, enzymatic, electrochemical, heat and photolabile methods. 
Within further embodiments, the steps of separating, cleaving and detecting may be performed in a continuous manner, 
for example, on a single device which may be automated. 

[0023] Within certain embodiments of the invention, :he size of the hybridized tagged probes, unhybridized probes 
or target molecules, or prcbe:target hybrids are altered by a method selected from ;he group consisting of polymerase 
20 extension, ligation, exonuclease digestion, endonuclease digestion, restriction enzyme digestion, site-specific recom- 
binase digestion, ligation, mismatch specific nuclease digestion, methylation-specific nuclease digestion, covalent at- 
tachment of probe to target and hybridization. 

[0024] The methods an compositions described herein may be utilized in a wide variety of applications, including for 
example, identifying PCR amplicons. RNA fingerprinting, differential display, single-strand conformation polymorphism 
detection, dideoxytingerprinting. restriction maps -a.'-u rostiiction fragment iengtn polymorphisms, ON A fingerprinting, 
genotyping, mutation detection, oligonucleotide ligation assay, sequence specific amplifications, for diagnostics, fo- 
ronsics, identification, developmental biology, biology, molecular medicine, toxicology, animal breeding, 
[0025] Those and other aspects of the present invention will become evident upon reference to the following detailed 
description and attached drawings. In addition, various references are set forth below which describe in more detail 
so certain procedures or compositions [e.g.. plasm id 3. etc.). and ire therefore incorporated by reference in their entirety. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0026] Figure 1 depicts the flowchart for the synthesis of pentafluorcphenyl esters of chemically cleavable mass 
os spectroscopy tags, to liberate tags with carboxyl amide termini. 

[0027] Figure 2 depicts the flowchart for the synthesis of pentafluorcphenyl esters of chemically cleavable mass 
spectroscopy tags, to liberate tags with carboxyl acid termini. 

[0028] Figures 3-6 and 8 depict the flowchart for the synthesis of tetrafluorophenyl esters of a set of 36 photochem- 
ically cleavable mass spectroscopy tags. 
40 [0029] Figure 7 depicts the flowchart for the synthesis of a set of 36 amine-terminated photochemically cleavable 
mass spectroscopy tags. 

[0030] Figure 9 depicts the synthesis of 36 photochemically cleavable mass spectroscopy tagged oligonucleotides 
made from the corresponding set of 36 tetrafluorophenyl esters of photochemically cleavable mass spectroscopy tag 
acids. 

45 [0031] Figure 10 depicts the synthesis of 36 photochemically cleavable mass spectroscopy tagged oligonucleotides 
made from the corresponding set of 36 amine-terminated photochemically cleavable mass spectroscopy tags. 
[0032] Figure 11 illustrates the simultaneous detection of multiple tags by mass spectrometry. 
[0033] Figure 12 shows the mass spectrogram of the alpha-cyano matrix alone. 
[0034] Figure 13 depicts a modularly-constructed tagged nucleic acid fragment. 

50 

DETAILED DESCRIPTION OF THE INVENTION 

[0035] As noted above, the present invention provides compositions and methods for analyzing nucleic acid mole- 
cules, wherein separation of nucleic acid molecules based on size is required. The present methods permit the simul- 
^ 5 taneous detection of molecules of interest, which include nucleic acids and fragments, proteins, peptides, etc. 

[0036] Briefly stated, in one aspect the present invention provides compounds wherein a molecule of interest, or 
precursor thereto, is linked via a labile bond (or labile bonds) to a tag. Thus, compounds of the invention may be viewed 
as having the general formula: 
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T-L-X 

wherein T is the tag component, L is the linker component that either is, or contains, a labile bond, and X is either the 
5 molecule of interest (MOI) component or a functional group component (L h ) through which the MOl may be joined to 
T-L Compounds of the invention may therefore be represented by the more specific general formulas. 

T-L-MOi 

w 

and 

T-L-L h 

IS 

[0037]- For reasons described in detail below, sets of T-L-MOI compounds may be purposely subjected to conditions 
that cause the labile bond(s) to break, thus releasing a tag moiety from the remainder of the compound. The tag moiety 
is then characterized by one or more analytical techniques, to thereby provide direct information about the structure 
of the tag moiety, and (most importantly) indirect information about the identity of the corresponding MOL 
20 [0038] As a simple illustrative example of a representative compound of the invention wherein L is a direct bond, 
reference is made to the following structure (i): 



25 Structure (i) 



30 




id Fragment) 



35 V v ' v v ' 

Tag component Molecule of Interest 

component 

40 In structure (i), T is a nitrogen-containing polycyclic aromatic moiety bonded to a carbonyl group, X is a MOI (and 
specifically a nucleic acid fragment terminating in an amine group), and L is the bond which forms an amide group. 
The amide bond is labile relative to the bonds in T because, as recognized in the art, an amide bond may be chemically 
cleaved (broken) by acid or base conditions which leave the bonds within the tag component unchanged. Thus, a tag 
moiety (i.e., the cleavage product that contains T) may be released as shown below: 

45 



50 
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(Nucleic Acid Fragment) 



acid or base 



OH H 2 N' 



.(Nucleic Acid Fragment) 



Tag Moiety 



Remainder of the Compound 



[0039] However, the linker L may be more than merely a direct bond, as shown in the following illustrative example, 
where reference is made to another representative compound of the invontion having the structure (ii) shown below: 



Structure (if) 






N0 2 











o 



-V" 
T 



1 



^(Nucleic Acid 
Y Fragrant) 
H 



MOT 



it is well-known that compounds having an ort/ro-nitrobenzylamine moiety (see boxed atoms within structure (ii)) are 
photolytically unstable, in that exposure of such compounds to actinic radiation of a specified wavelength will cause 
selective cleavage of the benzylamine bond (see bond denoted with heavy line in structure (ii)). Thus, structure (ii) has 
the same T and MO! groups as structure (i), however the linker group contains multiple atoms and bonds within which 
there is a particularly labile bond. Photolysis of structure (ii) thus releases a tag moiety (T-containing moiety) from the 
remainder of the compound, as shown below. 



EP 0 840 804 B1 




Tag Moiety Remainder of the Compound 



[0040] The invention thus provides compounds which, upon exposure to appropriate cleavage conditions, undergo 
a cleavage reaction so as to release a tag moiety from the remainder of the compound. Compounds of the invention 
may be described in terms of the tag moiety, the MOI (or precursor thereto, L h ), and the labile bond(s) which join the 

two groups together. Alternatively, the compounds of Ihe invention may bo described in terms of the components from 
which they are formed. Thus, tho compounds may be described as the reaction product of a tag reactant, a linker 
reactant and a MO! reactant, as follows. 

[0041] The tag reactant consists of a chemical handle (T h ) and a variable component (T vc ), so that the tag reactant 
is seen to have the general structure; 

T ,c- T l, 

To illustrate this nomenclature, reference may be made to structure (iii), which shows a tag reactant that may be used 
to prepare the compound of structure (ii). The tag reactant having structure (iii) contains a tag variable component and 
a tag handle, as shown below: 



Structure (iii) 




Tag Variable Tag 
Component Handle 



[0042] In structure (iii), the tag handle (-C(=0)-A) simply provides an avenue for reacting the tag reactant with the 
linker reactant to form a T-L moiety. The group "A" in structure (Iii) indicates that the carboxyl group is in a chemically 
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active state, so it is ready for coupling with other handles. 'A* may be, for example, a hydroxyl group or pentafluoroph- 
enoxy, among many other possibilities. The invention provides for a large number of possible tag handles which may 
be bonded to a tag variable component, as discussed in detail below. The tag variable component is thus a part of "T 
in the formula T-L-X, and will also be part of the tag moiety that forms from the reaction that cleaves L. 
[0043] As also discussed in detail below, the tag variable component is so-named because, in preparing sets of 
compounds according to the invention, it is desired that members of a sot have unique variable components, so that 
the individual members may be distinguished from one another by an analytical technique. As one example, the tag 
variable component of structure (iii) may be one member of the following set, where members of the set may be 
distinguished by their UV or mass spectra: 




[0044] Likewise, the linker reactant may be described in terms of its chemical handles (there are necessarily at least 
two each of which may be designated as L h ) which flank a tinker labile component, where the linker labile component 
consists of the required labile moiety (L 2 ) and optional labile moseties (U and L 3 ), where the optional labile moieties 
effectively serve to separate I 2 from the handles L h , and the required labile moiety serves to provide a labile bond 
within the linker labile component. Thus, the linker reactant may be seen to have the general formula: 

L h -L 1 -L 2 -L 3 -L h 

[0045] The nomenclature used to describe the linker reactant may be illustrated in view of structure (iv) which again 
draws from the compound of structure (ii): 



Structure (iv) 



Linker 
Handle 




[0046] As structure (iv) illustrates, atoms may serve in more than one functional role. Thus, in structure (iv), the 
benzyl nitrogen functions as a chemical handle in allowing the linker reactant to join to the tag reactant via an amide- 
forming reaction, and subsequently also serves as a necessary part of the structure of the labile moiety L 2 in that the 
benzylic carbon-nitrogen bond is particularly susceptible to photolytic cleavage. Structure (iv) also illustrates that a 
linker reactant may have an L 3 group (in this case, a methylene group), although not have an L 1 group. Likewise, linker 
reactants may have an L 1 group but not an L 3 group, or may have L 1 and L 3 groups, or may have neither of L 1 nor L 3 
groups. In structure (iv), the presence of the group "P" next to the carbonyl group indicates that the carbonyl group is 
protected from reaction. Given this configuration, the activated carboxyl group of the tag reactant (iii) may cleanly react 
with the amine group of the linker reactant (iv) to form an amide bond and give a compound of the formula T-L-L h . 
[0047] The MOI reactant is a suitably reactive form of a molecule of interest. Where the molecule of interest is a 
nucleic acid fragment, a suitable MOI reactant is a nucleic acid fragment bonded through its 5' hydroxyl group to a 
phosphodiester group and then to an alkylene chain that terminates in an amino group. This amino group may then 
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react with the carbonyl group of structure (iv), (alter, of course, deprotecting the carbonyl group, and preferably after 
subsequently activating the carbonyl group toward reaction with the amine group) to thereby join the MOI to the linker. 

[0048] Whon viewed in a chronological order, the- invention is seen to take a tag reactant (having \ chemical tag 
handle and a tag variable component), a linker reactant (having two chemical linker handles, a required labile moiety 

5 and 0-2 optional labile moieties) and a MOI reactant (having a molecule of interest component and a chemical molecule 
of interest handle) to form T-L-MOI. Thus, to form T-L-MOI, either the tag reactant and the linker reactant are first 
reacted together to provide T-L-L h , and then the MOI reactant is reacted with T-L-L h so as to provide T-L-MOI, or else 
(less preferably) the linker reactant and the MOI reactant are reacted together first to provide L h -L-MOI, and then L h - 
L-MO! is reacted with the tag reactant to provide T-L-MOI. For purposes of convenience, compounds having the formula 

to T-L-MOI will be described in terms of the tag reactant, the linker reactant and the MOI reactant which may be used to 
form such compounds. Of course, the same compounds of formula T-L-MOI could be prepared by other (typically, more 
laborious) methods, and still fall within the scope of the inventive T-L-MOI compounds. 

[0049] In any event, the invention provides that a T-L-MOI compound be subjected to cleavage conditions, such that 
a tag moiety is released from the remainder of the compound. The tag moiety will comprise at least the tag variable 
component, and will typically additionally comprise some or all of the atoms from the tag handle, some or all of the 
atoms from the linker handle that was used to join the tag reactant to the linker reactant, the optional labile moiety L 1 
if this group was present in T-L-MOI, and will perhaps contain some part of the required labile moiety L 2 depending on 

...u r . « ... v_ .... fc , . ^. ., ^..w.a^ ^. /. . „ .. .;. , j j referred 

to as the T-containing moiety because T will typically constitute the major portion (in terms of mass) of the tag moiety. 
20 [0050] Given this introduction to one aspect of the present invention, the various components T, L and X will be 
described in detail. This description begins with the following definitions of certain terms, which will be used hereinafter 
in describing T, L and X. 

[0051] As used herein, the term "nucleic acid fragment" means a molecule which is complementary to a selected 
target nucleic acid molecule {i.e., complementary to all or a portion thereof), and may be derived from nature or syn- 

-5 theticaily or rocombinantly produced, including non-naturally occurring molecules, and may be in double or single 
stranded form whore appropriate; and includes an oligonucleotide (e.g., DNA or RNA), a primer, a probe, a nucleic 
acid analog {e.g., PNA), an oligonucleotide which is extended in a 5' to 3' direction by a polymerase, a nucleic acid 
which is cleaved chemically or on^/matiealiy, a nucleic acid that is terminated with a dideoxy terminator or capped at 
the 3' or 5' end with a compound that prevents pclymorization at the 5' or 3* end. and combinations thereof. The 

7 c complementarity of a nucleic acid fragment to a selected target nucleic acid molecule generally means the exhibition 
of at ieast about 70% specific base pairing throughout the length of the fragment. Preferably the nucleic acid fragment 
exhibits at loast about 30% specific base pairing; and most preferably at least about 90%. Assays for determining the 
percent mismatch (and thus the percent specific base pairing) are well known in the art and are based upon the percent 
mismatch as a function of the Tm when referenced to the fully base paired control. 

os [0052] As used herein, the term "alky!, 11 alone or in combination, refers to a saturated, straight-chain or branched- 
chain hydrocarbon radical containing from 1 to 10, preferably from 1 to 6 and more preferably from f to 4, carbon 
atoms. Examples of juch radicals include, but an net limited to, methyl, ethyl, n -propyl, iso-prcpyl, n-butyl, iso butyl, 
sec-butyl, tert-butyt, pentyl, iso-amyl, hexyl, docyl and the like. The term "alkylene" refers to a saturated, straight-chain 
or branched chain hydrocarbon diradical containing from 1 to 10, preferably from 1 to 6 and more preferably from 1 to 

40 4, carbon atoms. Examples of such diradicals include, but are not limited to, methylene, ethylene (-CH 2 -CH 2 -) T propyl- 
ene, and the like. 

[0053] The term "alkenyl, ' alone or in combination, refers to a straight-chain or branched-chain hydrocarbon radical 
having at least one carbon-carbon double bond in a total of from 2 to 10, preferably from 2 to 6 and more preferably 
from 2 to 4, carbon atoms. Examples of such radicals include, but are not limited to, ethenyl, E- and Z-propenyl, iso- 
propenyl, E- and Z-butenyl, E- and Z-isobutenyl, E- and Z-pentenyl, decenyf and the like. The term "alkenylene" refers 
to a straight-chain or branched-chain hydrocarbon diradical having at least one carbon-carbon double bond in a total 
of from 2 to 10, preferably from 2 to S and more preferably from 2 to 4, carbon atoms. Examples of such diradicals 
include, but are not limited to, methylidene (=CH 2 ), ethylidene (-CH-CH-), propylidene (-CH 2 -CH=CH-) and the like. 
[0054] The term "alkynyl," aione or in combination, refers to a straight-chain or branched-chain hydrocarbon radical 

so having at least one carbon-carbon triple bond in a total of from 2 to 1 0, preferably from 2 to 6 and more preferably from 
2 to 4, carbon atoms. Examples of such radicals include, but are not limited to, ethynyl (acetylenyl), propynyl (propargy f) , 
butynyl, hexynyl, decynyl and the like. The term "alkynylene", alone or in combination, refers to a straight-chain or 
branched-chain hydrocarbon diradical having at least one carbon-carbon triple bond in a total of from 2 to 1 0, preferably 
from 2 to 5 and more preferably from 2 to 4, carbon atoms. Examples of such radicals include, but are not limited, 

55 ethynylene (-C-C-), propynylene (-CH 2 -C=C-) and the like. 

[0055] The term "cycloalkyl," alone or in combination, refers to a saturated, cyclic arrangement of carbon atoms 
which number from 3 to 8 and preferably from 3 to 6, carbon atoms. Examples of such cycloalkyl radicals include, but 
are not limited to, cyclopropyl, cyclobuty!, cyclopentyl, cyclohexyl and the like. The term 'cycloalkylene' refers to a 
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diradical form of a cycloalkyl. 

[0056] The term "cycloalkenyl," alone or in combination, refers to a cyclic carbocycle containing from 4 to 8, preferably 

5 or 6, carbon atoms and one or mere double bonds. Examples of such cyclcalkenyl radicals include, but are not limited 
to, cyclopentenyl, cyclohexenyl, cyclopentadienyl and the like. The term "cycloalkenylene" refers to a diradical form of 
a cycioalkenyl. 

[0057] The term "aryf refers to a carbocyclic (consisting entirely of carbon and hydrogen) aromatic group selected 
from the group consisting of phenyl, naphthyl, indeny!, indanyl, azulenyl, fluorenyl, and anthracenyl or a heterocyclic 
aromatic group selected from the group consisting of fury!, thienyl, pyridyl, pyrrolyl, oxazofyly, thiazolyl, imidazolyl, 
pyrazolyi, 2-pyrazolinyi, pyrazolidinyl, isoxazolyl, isothiazolyl, 1, 2, 3-oxadiazolyl, 1, 2, 3-triazoiyl, 1, 3, 4-thiadiazolyl, 
pyridazinyl, pyrimidinyl, pyrazinyl, 1, 3, 5-triazinyl, 1, 3, 5-trithianyl, indolizinyl, indolyl, isoindolyl, 3H-indolyl, indolinyl, 
benzo[b]furanyl, 2, 3-dihydrobenzofuranyl, benzojbjthiophenyl, 1 H-indazolyl, benzimidazolyl, benzthiazolyl, purinyl, 
4H-quinoiizinyl, quinolinyl, isoquinolinyl, cinnolinyl, phthalazinyl, quinazolinyl, quinoxalinyl, 1, 8-naphthyridinyl, pterid- 
inyl, carbazolyl, acridinyl, phenazinyl, phenothiazinyl, and phenoxazinyl. 

[0058] "Aryl" groups, as defined in this application may independently contain one to four substituents which are 
independently selected from the group consisting of hydrogen, halogen, hydroxyl, amino, nitro, trifluoromethyl, trifluor- 
omethoxy, alkyl, alkenyl, alkynyl, cyano, carboxy, carboalkoxy, 1 ,2-dioxyethylene, alkoxy, alkenoxy or alkynoxy, 
alkylamino, alkenylamino, alkynylamino, aliphatic or aromatic acyl, alkoxy-carbonylamino, alkylsulfonylamino, mor- 

alkoxyurea; N-hydroxylurea; N-alkenylurea; N ; N-(alkyl, hydroxyurea; heterocyclyl; thioaryloxy-substituted aryl; ISI.N- 
(aryl, alkyl)hydrazino; Ar'-substituted sulfonylheterocyclyl; aralkyl-substituted heterocyclyl; cycloalk, and cycloakenyl- 
substituted heterocyclyl; cycloalkyt-fused aryl; aryloxy-substituted alkyl; heterocyclylammo; aliphatic or aromatic 
acylaminocarbonyl; aliphatic or aromatic acyl-substitutod alkenyl; Ar'-substituted aminocarbonyloxy; Ar', Ar'-disubsti- 
tuted aryl; aliphatic or aromatic acyl-substituted acyl; cycioalkylcarbonylalkyl; cycloalkyl-substituted amino: aryloxycar- 
bonylalkyl; phosphorcdiamidyl acid or ester; 

[0059] " Ar"' is a carbocyclic or heterocyclic aryl group as defined above having one to three substituents selected 
from the group consisting of hydrogen, halogen, hydroxyl, amino, nitro, trifluoromethyl, trifluoromethoxy, alkyl, alkenyl, 
aikynyl, 1 ,2-dioxymethylene, 1 ,2-dioxyethylono, alkoxy, alkenoxy, alkynoxy, alkylamino, alkenylamino or aikynylamino, 
alkylcarbcnyloxy, aliphatic or aromatic acyl, alkyicarbonylamino, aikoxycarbonylamino alkylsulfonylamino, N-alkyl or 
N,N-dialkyl urea. 

[0060] The term "alkoxy," alone or in combination, refers to an alkyl ether radical, wherein the term "alkyl" is as 
defined above. Examples of suitable aikyl ether radicals include, but are not limited to, methoxy, ethoxy, n-propcxy, 
iso-propoxy, n-butoxy, iso-butoxy, sec-butoxy, tert-butoxy and the like. 

[0061] The term "alkenoxy," alone or in combination, refers to a radical of formula alkenyl-O-, wherein the term 
"alkenyl" is as defined above provided that the radical is not an enol ether. Examples of suitable alkenoxy radicals 
include, but are not limited to, ailyloxy, E- and Z-3-methyl-2-propenoxy and the like. 

[0062] The term "alkynyloxy," alone or in combination, refers to a radical of formula alkynyl-O, wherein the term 
'aikynyl" is as defined above provided that the radical is not an ynol ether. Examples of suitable alkynoxy radicals 
include, but are not limited to, propargyloxy, 2-butynyloxy and the like. 

[0063] The term "thioalkoxy" refers to a thioether radical of formula alkyl-S-, wherein alkyl is as defined above. 
[0064] The term "alkylamino," alone or in combination, refers to a mono- or di-alkyl-substituted amino radical (i.e., 
a radical of formula alkyl-NH- or (alkyl) 2 -N-), wherein the term "alkyl" is as defined above. Examples of suitable alkylami- 

fio fauicaio ificiuoe, out aro not limited ic, mothyiainino. eihyiaminu, propylamine, isopropyiamino, t-butyiamino, N,N- 
diethylamino and the like. 

[0065] The term "alkenylamino." alone or in combination, refers to a radical of formula alkenyf-NH- or (afkenyl) 2 N-, 
wherein the term "alkenyl" is as defined above, provided that the radical is not an enamine. An example of such alke- 
nylamino radicals is the allylamino radical. 

[0066] The term "aikynylamino," alone or in combination, refers to a radical of formula alkynyl-NH- or (aikynyl) 2 N-, 
wherein the term "alkynyl" is as defined above, provided that the radical is not an ynamine. An example of such alky- 
nylamino radicals is the propargyl amino radical. 

[0067] The term "amide" refers to either -N(R 1 )-C(^0)- or -C(=0)-N(R 1 )-where R 1 is defined herein to include hy- 
drogen as well as other groups. The term "substituted amide" refers to the situation where R 1 is not hydrogen, while 
the term "unsubstituted amide" refers to the situation where R 1 is hydrogen. 

[0068] The term "aryloxy," alone or in combination, refers to a radical of formula aryl-O-, wherein aryl is as defined 
above. Examples of aryloxy radicals include, but are not limited to, phenoxy, naphthoxy, pyridyloxy and the like. 
[0069] The term "arylamino," alone or in combination, refers to a radical of formu!aaryl-NH-, wherein aryl is as defined 
above. Examples of arylamino radicals include, but are not limited to, phenylamino (anilido), naphthylamino, 2-, 3- and 
4-pyridylamino and the like. 

[0070] Tho term "aryf-fuscd cycicalkyl," alone or in combination, refers to a cycloalkyl radical which shares two ad- 
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jacent atoms with an aryl radical, wherein the terms "cycloalkyl" and "aryl" are as defined above. An example of an 
aryl-fused cycloalkyl radical is the benzofused cyclobutyl radical. 

[0071] The term "alkyicarbonylamino," alone or in combination, refers to a radical of formula alkyl-CONH, wherein 
the term "alkyl" is as defined above. 

[0072] Thetorm "alkoxycarbonylammo," alone or in combination, refers to a radical of formula alkyl-OCONH-, wherein 
-the term "alkyl" is as defined above. 

[0073] The term "alkylsulfonylamino/' alone or in combination, refers to a radical of formula a!kyl-S0 2 NH-, wherein 
the term "alkyi" is as defined above. 

[0074] The term "arylsulfonylarnino," alone or in combination, refers to a radical of formula aryl-SQ 2 NH-, wherein 
the term "aryl' 1 is as defined above. 

[0075] The term "N-alkylurea," alone or in combination, refers to a radical of formula alkyl-NH-CO-NH-, wherein the 
term "alkyi" is as defined above. 

[0076] The term "N-arylurea," alone or in combination, refers to a radical of formula aryl-NH-CONH-, wherein the 
term "aryl" is as defined above. 

[0077] The term "halogen" means fluorine, chlorine, bromine and iodine. 

[0078] The term "hydrocarbon radical" refers to an arrangement of carbon and hydrogen atoms which need only a 
single hydrogen atom to be an independent stable molecule. Thus, a hydrocarbon radical has one open valence site 
on a carbon atom, through which the hydrocarbon radical may be bonded to other atom(s). Alkyl, alkenyl, cycloalkyl, 
etc. are examples of hydrocarbon radicals. 

[0079] The term "hydrocarbon d, radical" refers to an arrangement of carbon and hydrogen atoms which need two 
hydrogen atoms in order to be an independent stable molecule. Thus, a hydrocarbon radical has two open valence 
sites on one or (wo carbon atoms, through ^J-vrh the hydrocarbon radical may be bonded to other atom(s). Alkylene, 
aikenylono, alkynylone, cycloalkylene, etc. are examples of hydrocarbon diradicals. 

[0080] The term "hydrocarbyl" refers to any stable arrangement consisting entirely of carbon and hydrogen having 
a single valence site to which it is bonded to another moiety, and thus includes radicals known as alkyi, alkonyl, alkynyl, 
cycloalkyl, cycloalkenyl, aryl (without heteroatom incorporation into the aryl ring), arylalkyl, alkylaryi and the like. Hy- 
drocarbon radical is another name for hydrocarbyl 

[0081] The term "hydrocarbylene" refers to any stable arrangement consisting entirely of carbon and hydrogen having 
two valence sites to which it is bonded to other moieties, and thus includes alkylene. alkenylene, alkynylene, cy- 
cloalkylene, cycloalkenylene, arylene (without heteroatom incorporation into the arylene ring), arylalkylene, aiky- 
larylene and the like. Hydrocarbon diradica! is another name for hydrocarbylene. 

[0082] The term "hydrocarbyl-O-hydrccarbylene" refers to a hydrocarbyl group bonded to an oxygen atom, where 
the oxygon atom is likewise bonded to a hydrocarbylene group at one of the two valence sites at which the hydrocar- 
bylene group is bonded to other moieties. The terms "hydrocarbyl-S-hydrocarbylene", Ttydrocarbyl-NH-hydrocarb- 
ylene" and "hydrocarbyl-amide-hydrocarbylene" have equivalent meanings, where oxygen has been replaced with 
sulfur, -NH- or an amide group, respectively. 

[0083] The term N-(hydrocarbyl)hydrocarby!ene refers to a hydrocarbylene group wherein one of the two valence 
sites is bonded to a nitrogen atom, and that nitrogen atom is simultaneously bonded to a hydrogen and a hydrocarbyl 
group. The term N,N-di(hydrocarbyl)hydrocarbylene refers to a hydrocarbylene group wherein one of the two valence 
sites is bonded to a nitrogen atom, and that nitrogen atom is simultaneously bonded to two hydrocarbyl groups. 
[0084] The term "hydrocarbylacyl-hydrcc a.Ly :r " .. .: : \z a hydrocarbyl group bonaed througn an acyi (-C(=Q)-) 
group to one of the two valence sites of a hydrocarbylene group. 

[0085] The terms "heterocyclylhydrocarbyl" and "heterocylyl" refer to a stable, cyclic arrangement of atoms which 
include carbon atoms and up to four atoms (referred to as heteroatoms) selected from oxygen, nitrogen, phosphorus 
and sulfur. The cyclic arrangement may be in the form of a monocyclic ring of 3-7 atoms, or a bicyciic ring of 8-11 
atoms. The rings may be saturated or unsaturated (including aromatic rings), and may optionally be benzofused. Ni- 
trogen and sulfur atoms in the ring may be in any oxidized form, including the quaternized form of nitrogen. A hetero- 
cyclylhydrocarbyl may be attached at any endocyclic carbon or heteroatom which results in the creation of a stable 
structure. Preferred heterocyclylhydrocarbyls include 5-7 membered monocyclic heterocycles containing one or two 
nitrogen heteroatoms. 

[0086] A substituted heterocyclyfhydrocarbyl refers to a heterocyclylhydrocarbyl as defined above, wherein at feast 
one ring atom thereof is bonded to an indicated substituent which extends off of the ring. 

[0037] In referring to hydrocarbyl and hydrocarbylene groups, the term "derivatives of any cf the foregoing wherein 
one or more hydrogens is replaced with an equal number of fluorides" refers to molecules that contain carbon, hydrogen 
and fluoride atoms, but no other atoms. 

[0088] The term "activated ester" is an ester that contains a "leaving group" which is readily displaceable by a nu- 
cleophile, such as an amine, and alcohol or a thiol nucleophile. Such leaving groups are well known and include, without 
limitation, N-hydroxysuccinimide, N-hydroxybenzotriazole, halogen (haltdes), alkoxy including tetrafluorophenolates, 
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thioalkoxy and the Ilka. The term "protected ester" refers to an ester group that is masked or otherwise unreactive. 
See, e.g., Greene, "Protecting Groups In Organic Synthesis." 

[0089] In view of the above definitions, other chemical terms used throughout this application can be easily under- 
stood by those of skiil in the art. Terms may oe used alone or in any combination thereol. The preferred and more 
preferred chain lengths of the radicals apply to all such combinations. 

A. GENERATION OF TAGGED NUCLEIC ACID FRAGMENTS 

[0090] As noted above, one aspect of the present invention provides a general scheme for DNA sequencing which 
allows the use of more than 16 tags in each lane; with continuous detection, the tags can be detected and the sequence 
read as the size separation is occurring, just as with conventional fluorescence-based sequencing. This scheme is 
applicable to any of the DNA sequencing techniques based on size separation of tagged molecules. Suitable tags and 
linkers for use within the present invention, as well as methods for sequencing nucleic acids, are discussed in more 
detail betow. 

1 . Tags 

[0091] "Tag", as used herein, generally refers to a chemical moiety which is used to uniquely identify a "molecule of 
interest", and more specifically refers to the tag variable component as well as whatever may be bonded most closely 
to it in any of the tag reactant, tag component and tag moiety. 
[0092] A tag which is useful in the present invention possesses several attributes: 

1 ) It is capable of being distinguished from all other tags. This discrimination from other chemical moieties can be 
based on the chromatographic behavior of the tag (particularly after the cleavage reaction), its spectroscopic or 
potentiometric properties, or some combination thereof. Spectroscopic methods by which tags are usefully distin- 
guished include mass spectroscopy (MS), infrared (IR), ultraviolet (UV), and fluorescence, where MS, IR and UV 
are preferred, and MS most preferred spectroscopic methods. Potentiometric amperometry is a preferred poten- 
tiometric method. 

2) The tag is capable of being detected when present at 10" 22 to 10" 6 mole. 

3} The tag possesses a chemical handle through which it can be attached to the MOI which the tag is intended to 
uniquely identify. The attachment may be made directly to the MOI, or indirectly through a 'linker" group 

4) The tag is chemically stable toward ail manipulations to which it is subjected, including attachment and cleavage 
from the MOI, and any manipulations of the MOI while the tag is attached to it. 

5) The tag does not significantly interfere with the manipulations performed on the MOI while the tag is attached 
to it. For instance, if the tag is attached to an oligonucleotide, the tag must not significantly interfere with any 
hybridization or enzymatic reactions {e.g., PCR sequencing reactions) performed on the oligonucleotide. Similarly, 
if the tag is attached to an antibody, it must not significantly interfere with antigen recognition by the antibody. 

[0093] A tag moiety which is intended to be detected by a certain spectroscopic or potentiometric method should 
possess properties which enhance the sensitivity and specificity of detection by that method. Typically, the tag moiety 
will have those properties because they have been designed into the tag variable component, which will typically 
constitute the major portion of the tag moiety. In the following discussion, the use of the word "tag" typically refers to 
the tag moiety (i.e., the cleavage product that contains the tag variable component), however can also be considered 
to refer to the tag variable component itself because that is the portion of the tag moiety which is typically responsible 
for providing the uniquely detectable properties, in compounds of the formula T-L-X, the "T" portion will contain the tag 
variable component. Where the tag variable component has been designed to be characterized by, e.g., mass spec- 
trometry, the "T" portion of T-L-X may be referred to as T ms . Likewise, the cleavage product from T-L-X that contains 
T may be referred to as the T ms -containing moiety. The following spectroscopic and potentiometric methods may be 
used to characterize T ms -containing moieties. 

a. Characteristics of MS Tags 

[0094] Where a tag is analyzable by mass spectrometry (i.e , is a MS-readab!e tag, also referred to herein as a MS 
tag or H T ms -containing moiety"), the essential feature of the tag is that it is able to be ionized. It is thus a preferred 
element in the design of MS-readable tags to incorporate therein a chemical functionality which can carry a positive 
or negative charge under conditions of ionization in the MS. This feature confers improved efficiency of ion formation 
and greater overall sensitivity of detection, particularly in electrospray ionization. The chemical functionality that sup- 
ports an ionized charge may derive from T ms or L or both. Factors that can increase the relative sensitivity of an analyte 
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being detected by mass spectrometry are discussed in, e.g., Sunnor, J., et al. Anal. Cham. 60.1300-1307 (1988). 
[0095] A preferred functionality to facilitate the carrying of a negative charge is an organic acid, such as phenolic 
hydroxy), carboxylic acid, phosphonate, phosphate, tetrazoie, sulfonyl urea, perftuoro alcohol and sulfonic acid. 
[0096] Preferred functionality to facilitate the carrying of a positive charge under ionization conditions are aliphatic 
or aromatic amines. Examples of amine functional groups which give enhanced detectability of MS tags include qua- 
ternary amines (i.e., amines that have four bonds, each to carbon atoms, see Aebersold, U.S. Patent No. 5,240,859) 
and tertiary amines (i.e., amines that have three bonds, each to carbon atoms, which includes C-N-C groups such as 
are present in pyridine, see Hess et al., Anal. Biochom. 224:373, 1995; Bures et a!., Anal. Biochom. 224:364, 1995). 
Hindered tertiary amines are particularly preferred. Tertiary and quaternary amines may be alky! or aryl. A T ms -con- 
taining moiety must bear at least one ionizabie species, but may possess more than one ionizable species. The pre- 
ferred charge state is a single ionized species per tag. Accordingly, it is preferred that each T ms -containing moiety (and 
each tag variable component) contain only a single hindered amine or organic acid group. 
[0097] Suitable amine-containing radicals that may form part of the T ms -containing moiety include the following: 
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[0098] Tho identification of a tag by mass spectrometry is preferably based upon its molecular mass to charge ratio 
(m/z). The preferred molecular mass range of MS tags is from about 100 to 2,000 daltons, and preferably the T 113 - 
to containing moiety has a mass of at least about 25Q daltons, more preferably at least about 300 daltons, and still more 
preferably at least about 350 daltons. It is generally difficult for mass spectrometers to distinguish among moieties 
having parent ions below about 200-250 daltons (depending on the precise instrument), and thus preferred ^-con- 
taining moieties of the invention have masses above that range. 

[0099] As explained above, the T ms -containing moiety may contain atoms other than those present in the tag variable 
?S component, and indeed other than present in T ms itself. Accordingly, the mass of T ms itself may be less than about 250 
daltons, so long as ihe T ms -ccntaining moiety has a mass of at least about 250 daltons. Thus, the mass of T ms may 
range from 1 5 (i.e., a methyl radical) to about 10,000 daltons, and preferably ranges from 100 to about 5,Q0G daltons, 
and more preferably ranges (rom about 200 to about 1,000 daltons. 

[0100] It is relatively difficult to distinguish tags by mass spectrometry when those tags incorporate atoms that have 
20 more than one isotope in significant abundance. Accordingly, preferred T groups which are intended for mass spec- 
troscopic identification (T WG groups), contain carbon, at least one of hydrogen and fluoride, and optional atoms selected 
from oxygen, nitrogen, sulfur, phosphorus and iodine. While other atoms may be present in the T™ 3 , their presence 
can render analysis of the mass spectral data somewhat more difficult. Preferably, the T™ 3 groups have only carbon, 
nitrogen and oxygen atoms, in addition to hydrogen and/or fluoride. 
m [0101] Fluoride is an optional yet preferred atom to have in a T mci group In comparison to hydrogen, fluoride is, of 
course, much heavier. Thus, the presenco of fluoride atoms rather than hydrogen atoms leads to T ma groups of higher 
mass, thereby allowing the T™' 3 group to reach and exceed a mass of greater than 250 daltons, which is desirable as 
explained above, In addition, the replacement of hydrogen with fluoride confers greater volatility on the T ma -containing 
moiety, and greater volatility of the analyte enhances sensitivity when mass spectrometry is being used as the detection 
w method. 

[0102] The molecular formula of T ma falls within the scope of C,. 5 Q a N 0 ., 00 O 0M00 S 0 . t0 F 0 . 10 H a l s wherein the sum 
of a, |1 and 6 is sufficient to satisfy Ihe otherwise unsatisfied valencies of the C, N, O, S and P atoms. The designation 
Ct-"-ooNo.|o0 0 ., 00 S 0 . 10 P 0 _ 10 H a F, i l (5 means that T™ z contains at least one, and may contain any number from 1 to 500 
carbon atoms, in addition to optionally containing as many as 100 nitrogen atoms ("N 0 _" means that T ms need not 

35 contain any nitrogen atoms), and as many as 100 oxygen atoms, and as many as 10 sulfur atoms and as many as 10 
phosphorus atoms. The symbols a, \\ and 5 represent the number of hydrogen, fluoride and iodide atoms in T ms , where 
any two of these numbers may be zero, and where the sum of these numbers equals the total of the otherwise unsatisfied 
valencies of the C, N, O, S and P atoms. Preferably, T ms has a molecular formula that falls within the scope of 
c i-so N o-io°o-io H a F p where the sum of a and p equals the number of hydrogen and fluoride atoms, respectively, 

40 present in the moiety. 

b. Characteristics of IR Tags 

[0103] There are two primary forms of IR detection of organic chemical groups: Raman scattering IR and absorption 
^5 |R. Raman scattering IR spectra and absorption IR spectra are complementary spectroscopic methods. In general, 
Raman excitation depends on bond polarizability changes whereas IR absorption depends on bond dipole moment 
changes. Weak IR absorption lines become strong Raman lines and vice versa. Wavenumber is the characteristic unit 
for IR spectra. There are 3 spectral regions for IR tags which have separate applications: near IR at 12500 to 4000 
cm* 1 , mid IR at 4000 to 600 cm' 1 , far I R at 600 to 30 cm* 1 . For the uses described herein where a compound is to serve 
so as a tag to identify an MOI, probe or primer, the mid spectral regions would be preferred. For example, the carbenyl 
stretch (1850 to 1750 cm" 1 ) would be measured for carboxylic acids, carboxylic esters and amides, and alkyl and aryl 
carbonates, carbamates and ketones. N-H bending (1750 to 160 enr 1 ) would be used to identify amines, ammonium 
ions : and amides. At 1400 to 1250 cm -1 , R-OH bending is detected as well as the C-N stretch in amides. Aromatic 
substitution patterns are detected at 900 to 690 cm' 1 (C-H bending, N-H bending for ArNH 2 ). Saturated C-H, olefins, 
55 aromatic rings, double and triple bonds, esters, acetals, ketals, ammonium salts, N-O compounds such as oximes, 
nitro, N-oxides, and nitrates, azo, hydrazones, quinones, carboxylic acids, amides, and lactams all possess vibrational 
infrared correlation data (see Pretsch et aL, Spectral Data for Structure Determination of Organic Compounds, Spfinger- 
Verlag, New York, 1989). Preferred compounds would include an aromatic nitrile which exhibits a very strong nitrite 
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stretching vibration at 2230 to 2210 cm* 1 . Other useful types of compounds are aromatic alkynes which have a strong 
stretching vibration that gives rise to a sharp absorption band between 2140 and 2100 cm* 1 . A third compound type is 

the aromatic azides which exhibit an intense absorption band in the 2160 to 2120 cm* 1 region Thiocyanates are rep- 
resentative of compounds that have a strong absorption at 2275 to 2263 cm* 1 . 

5 

c. Characteristics of UV Tags 

[0104] A compilation of organic chromophore types and their respective UV-visiblo properties is given in Scott (In- 
terpretation of the UV Spectra of Natural Products, Pormagon Press, New York, 1962). A chromophore is an atom or 

io group of atoms or electrons that are responsible for the particular light absorption. Empirical rules exist for the;t to 7t* 
maxima in conjugated systems (see Pretsch et a!., Spectra! Data for Structure Determination of Organic Compounds, 
p. B65 and B70, Springer- Verlag, New York, 1989). Preferred compounds (with conjugated systems) would possess 
n to 7t* and it to re* transitions. Such compounds are exemplified by Acid violet 7, Acridine Orange, Acridine Yellow G, 
Brilliant Blue G, Congo Red, Crystal Violet, Malachite Green oxalate, Metanil Yellow ; Methylene Blue, Methyl Orange, 

is Methyl Violet B, Naphtol Green B, Oil Blue N, Oil Red O, 4-phenylazophenol, Safranie O, Solvent Green 3. and Sudan 
Orange G, all of which are commercially available (Aldricrr Milwaukee, Wl). Other suitable compounds are listed in, 
e.g.. Jane, I., otal,, J. Chrom. 323:191-225 (1985). 

J. Ciiajactenstic of a Fluorescent Tag 

20 

[0105] Fluorescent probes are identified and quantitated most directly by their absorption and fluorescence emission 
wavelengths and intensities. Emission spectra (fluorescence and phosphorescence) are much mo e sensitive and 
permit more specific measurements than absorption spectra. Other photcphysical characteristics such as excited-stato 
lifetime and fluorescence anisotropy are less widely used. The most generally useful intensity parameters are the molar 

2S extinction coefficient (t) for absorption and the quantum yield (QY) for fluorescence. The value of e is specified at a 
single wavelength (usually the absorption maximum of the probe), whereas QY is a measure of the total photon emission 
over the entire fluorescence spectral profile. A narrow optical bandwidth (<20 nm) is usually used for fluorescence 
excitation (via absorption), whereas the fluorescence detection bandwidth is much more variable, ranging from full 
spectrum for maximal sensitivity to narrow band (-20 nm) for maximal resolution. Fluorescence intensity per probe 

Jo molecule is proportional to the product of e and QY. The range of these parameters among fluorophores of current 
practical importance is approximately 1 0,000 to 1 00,000 crrr 1 M- 1 for e and 0. 1 to 1 0 for QY. Compounds that can serve 
as fluorescent tags are as follows: fluorescein, rhodamine, lambda blue 470, lambda green, lambda red 664, lambda 
red 665, acridine orange, and propidium iodide, which are commercially available from Lambda Fluorescence Co. 
(Pleasant Gap, PA). Fluorescent compounds such as nile red, Texas Red, lissamino™, BODIPY™ s are available from 

3$ Molecular Probes (Eugene, OR). 

e. Characteristics of Potentiometric Tags 

[0106] The principle of electrochemical detection (ECD) is based on oxidation or reduction of compounds which at 

4 o certain applied voltages, electrons are either donated or accepted thus producing a current which can be measured. 
When certain compounds are subjected to a potential difference, the molecules undergo a molecular rearrangement 
at the working electrodes' surface with the loss (oxidation) or gain (reduction) of electrons, such compounds are said 
to be electronic and undergo electrochemical reactions. EC detectors apply a voltage at an electrode surface over 
which the HPLC eluent flows. Electroactive compounds eiuting from the column either donate electrons (oxidize) or 

4 $ acquire electrons (reduce) generating a current peak in real time. Importantly the amount of current generated depends 
on both the concentration of the analyte and the voltage applied, with each compound having a specific voltage at 
which it begins to oxidize or reduce. The currently most popular electrochemical detector is the amperometric detector 
in which the potential is kept constant and the current produced from the electrochemical reaction is then measured. 
This type of spectrometry is currently called "potentiostatic amperometry". Commercial amperometers are available 

so from ESA, Inc., Chelmford, MA, 

[0107] When the efficiency of detection is 100%, the specialized detectors are termed "coulometric". Coulometric 
detectors are sensitive which have a number of practical advantages with regard to selectivity and sensitivity which 
make these types of detectors useful in an array. In coulometric detectors, for a given concentration of analyte, the 
signal current is plotted as a function of the applied potential (voltage) to the working electrode. The resultant sigmoidal 

55 graph is called the current-voltage curve or hydrodynamic voftammagram (HDV). The HDV allows the best choice of 
applied potential to the working electrode that permits one to maximize the observed signal. A major advantage of 
ECD is its inherent sensitivity with current levels of detection in the subfemtomclo range, 

[0108] N umerous chemicals and compounds are electrochemically active including many biochemicals, pharmaceu- 
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1,1 1 y^*''"''*™ nt "^ m-v— jm.-^i-Mji. . — > l >:t:-- -an bo effectively resolved oven if their half -wave 

potentials (the potential at half signal maximum) differ by only 30-60 mV. 

[0109] Recently developed coulometric sensors provide selectivity, identification and resolution of co-eluting com- 
pounds when used as dotectors in liquid chromatography based separations. Therefore, these arrayed detectors add 
another set of separations accomplished in the detector itself. Current instruments possess 16 channels which are in 
principle limited only by the rate at which data can bo acquired. The number of compounds which can be resolved on 
the EC array is chromatographically limited (/.a, plate count limited). However, if two or more compounds that chro- 
matographicaiiy co-elute have a difference in half wave potentials of 30-60 mV. the array is able to distinguish the 
compounds. The ability of a compound to be electrochemical ly active relies on the possession of an EC active group 
(i.e., -OH, -O.-N, -S). 

[0110] Compounds which have been successfully detected using coulometric detectors include 5-hydroxytryptamine, 
3-methoxy-4-hydroxyphenyl-glycol, homogentisic acid, dopamine, metanephrine, 3-hydroxykynureninr, acetami- 
nophen, 3-hydroxytryptophol, 5-hydroxyindoleacetic acid, octanesulfonic acid, phenol, o-cresoi, pyrogaliol, 2-nitroph- 
enpl, 4-nitrophenol, 2,4-dinitrophenol, 4,6-dinitrocresol, S-methyi^-nitrophenol, 2,4-dichlorophenol, 2,6-dichlorophe- 
nol, 2,4,5-trichlorophenol, 4-chiora-3-methylphenol, 5-methylphenol, 4-methy!-2-nitrophenoi, 2-hydroxyaniline, 4-hy- 
droxyaniline, 1 ,2-phenylenediarnine, benzocatechin, buturon, chlortholuron, diuron, isoproturon, linuron, methobro- 
muron, metoxuron, monolinuron, monuron, methionine, tryptophan, tyrosine, 4-aminobenzoic acid, 4-hydroxybenzoic 



acid, daidzein, datsscetin, diosmetin, epicatechin gailate, epigallo catechin, epigallo catechin gallate, eugenoi, eupa- 
torin, ferulic acid, fisetin, aalangin, gallic acid, gardenin, genistein, gentisic acid, hesperidin, irigenin, kaemferol, leu- 
coyanidin, luteolin, mangobiin, morin, myricetin, naringin, narirutin, pelargondin, peonidin, phloretm, pratensein, pro- 
tocatechuic acid, rhamnetin, quercetin, sakuranetin, scutellarein, scopolotin, syringaldehyde, syringic acid, tangeritin, 
troxerutin, umbeliiferone, vanillic acid, 1 ,3-dimethyl tetrahydroisoquinoline, 6-hydroxydopamine, r-salsolinol, N-methyl- 
r-salsolinci, tetrahydroisoquinoline, amitriptyline, apomorphine, capsaicin, chlordiazepoxide, chlorpromazino, dauno- 
rubicin, desipramine, doxepin, fluoxetine, flurazepam, imipramme, isoproterenol, methoxamine, morphine, morphine- 

3- glucuronide, nortriptyline, oxazepam, phenylephrine, trimipramme, ascorbic acid, N-acetyl serotonin, 3,4-dihydroxy- 
benzyiamine, 3,4-dihydroxymandeiic acid (DOMA), 3,4-dihydroxyphenylacetic acid (DOPAC), 3,4-dihydroxyphenyla- 
lanine (L-DOPA), 3,4-dihydroxyphenylglycol (DHPG), 3-hydroxyanthranilic acid, 2-hydroxyphenylacetic acid (2HPAC), 

4- hydroxybenzoic acid (4HBAC), 5-hydroxymdole-3-acetic acid (5HIAA), 3-hydroxykynurenine, 3-hydroxymandeiic ac- 
id, 3-hyuioxy-4-meUioxypiienylijthyiarninti, 4-hydf oxy phony iacelie ac;u i4hPAC), 4 -n ya roxy p h e ny i ia ctic acid (4HPLA), 

5- hydroxytryplophan (5HTP). 5-hydroxytryptcphoi (5HTOL), 5-hyaroxytryptamine (5HT), S-hyrjroxytryptamine sulfate, 
3-methoxy-4-hydrcxyphonylglycol (MHPG) : 5-methoxytryptamine, 5-methoxytryptophan, 5-methoxytryptophol, 
3-methoxytyramine (3MT), 3-methoxytyrosine (3-OM-DOPA), S-methylcysteine, 3-methylguanme, bufotenin, 
dopamine dopamme-3-giucuronide, dopamine-3-suifate, dcpamine-4-sulfate, epinephrine, epinine, folic acid, glutath- 
ione (reduced), guanine, guanosine, homogentisic acid (HGA). homovanillic acid (HVA), homovanillyl alcohol (HVOL), 
homoveratic acid, hva sulfate, hypoxanthine, indole, mdole-3-acetic acid, indole-3-lactic acid, kynurenme, melatonin, 
metanephrine, N-methyltryptamine, N-methyltyramine, N.N-dimethyltryptarnine, N,N-dimethyityramine, norepine- 
phrine, normetanephrine, octopamine, pyridoxal, pyridoxal phosphate, pyndoxamme, synephrine, tryptophol, tryp- 
tamine, tyramine, uric acid, vanillylmandelic acid (vma), xanthine and xanthosine. Other suitable compounds are set 
forth in, e.g., Jane, I., ot al. J. Chrom. 323:191-225 (1985) and Musch, G., et al., J. Chrom. 348:97-110 (1985). These 
compounds can be incorporated into compounds of formula T-L-X by methods known in the art. For example, com- 
pounds having a carboxyiic acid group may be reacted with amine, hydroxy!, etc. to form amide, ester and other linkages 
between T and L. 

[0111] In addition to the above properties, and regardless of the intended detection method, it is preferred that the 
tag have a modular chemical structure. This aids in the construction of large numbers of structurally related tags using 
the techniques of combinatorial chemistry. For example, the T ms group desirably has several properties. It desirably 
contains a functional group which supports a single ionized charge state when the T^-ccntaining moiety is subjected 
to mass spectrometry (more simply referred to as a "mass spec sensitivity enhancer" group, or MSSE). Also, it desirably 
can serve as one member in a family of T^-containing moieties, where members of the family each have a different 
mass/charge ratio however have approximately the same sensitivity in the mass spectrometer. Thus, the members 
of the family desirably have the same MSSE. In order to allow the creation of families of compounds, it has been found 
convenient to generate tag reactants via a modular synthesis scheme, so that the tag components themselves may 
be viewed as comprising modules. 

[0112] In a preferred modular approach to the structure of the T ms group, T ms has the formula 
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wherein T 2 is an organic moiety formed from carbon and one or more of hydrogen, fluoride, iodide, oxygen, nitrogen, 
sulfur and phosphorus, having a mass range of 15 to 500 daltons; T 3 is an organic moiety formed from carbon and 
one or more of hydrogen, fluoride, iodide, oxygen, nitrogen, sulfur and phosphorus, having a mass range of 50 to 1000 
daltons; J is a direct bond or a functional group such as amide, ester, amine, sulfide, ether, thioester, disulfide, thioether, 
urea, thiourea, carbamate, thiocarbamate, Schiff base, reduced Schiff base, imine, oxime, hydrazone, phosphate, 
phosphonate, phosphoramide, phosphonamide, sulfonate, sulfonamide or carbon-carbon bond; and n is an integer 
ranging from 1 to 50, such that when n is greater than 1 , each T 3 and J is independently selected. 
[0113] Tho modular structure T 2 -(J-T 3 ) n - provides a convenient entry to families of T-L-X compounds, where each 
member of the family has a different T group. For instance, when T is T™ 3 , and each family member desirably has the 
same MSSE, one of the T 3 groups can provide that MSSE structure. In order to provide variability between members 
of a family in terms of the mass of T m3 , the T 2 group may be varied among family members. For instance, one family 
member may have T 2 = methyl, while another has T 2 = ethyl, and another has T 2 = propyl, etc. 
[0114] In order to provide "gross" or large jumps in mass, a T 3 group may be designed which adds significant (e.g., 
one or several hundreds) of mass units to T-L-X. Such a T 3 group may be referred to as a molecular weight range 
adjuster groupfWRA"). A WRA is quite useful if one is working with a single set of T 2 groups, which will have masses 
extending over a limited range. A single set of T 2 groups may be used to create T ms groups having a wide range of 
mass simply by incorporating one or more WRA T 3 groups into the T ms . Thus, using a simple example, if a set of T 2 
groups affords a mass range of 250-340 daltons for the T™ s , the addition of a single WRA, having, as an exemplary 
number 100 daiton, as a T 3 group provides access to the mass range of 350-440 daltons while using the same set of 
T 2 groups. Similarly, the addition of two 100 daiton MWA groups (each as a T 3 group) provides access to the mass 
range of 450-540 daltons, where this incremental addition of WRA groups can be continued to provide access to a 
very large mass range for the T ms group. Preferred compounds of the formula ^-(J-T 3 -)^ -X have the formula R wc - 
( r wra)w _r msse" l " x where VWC is a "T 2 " group, and each of the WRA and MSSE groups are "T 3 " groups. This structure 
is illustrated in Figure 12, and represents one modular approach to the preparation of T ms 

[0115] In the formula T 2 -(J-T 3 -) n -, T 2 and T 3 aro preferably selected from hydrocarbyl, hydrocarbyl-O-hydrocarbylene, 
hydrocarbyl-S-hydrocarbylene, hydrocarbyl-NH-hydrocarbylene, hydrocarbyl-amide-hydrocarbylene, N-(hydrocarbyl) 
hydrocarbylene, N,N-di{hydrocaibyl)hydrocarbylone, hydrocarbylacyl-hydrocarbylcne, heterocyclylhydrocarbyl 
wherein tho hoteroatom(si run selected from oxygen, nitrogen, sulfur and phosphorus, substituted hetorocyclylhydro- 
carbyi wherein the heteroatom(s) aro selected from oxygen, nitrogen, sulfur and phosphorus and the substituents are 
selected from hydrocarbyl, hydrocarbyi-C-hydrocarbylene, hydrocarbyl-NH-hydrocarbylene, hydrccarbyl-S-hydrocar- 
bylene, N-(hydrocarbyl)hydrocarbylene, N,N-di(hydrocarbyl)hydrocarbyleno and hydrocarbylacyl-hydrocarbylene. In 
addition, T 2 and/or T 3 may be a derivative of any of tho previously listed potential T 2 / T 3 groups, such that one or more 
hydrogens are replaced fluorides. 

[0116] Also regarding the formula T 2 -(J-T 3 -) n -, a preferred T 3 has the formula -G(R 2 )-, wherein G is C,^ alkylene 
chain having a single R 2 substituont. Thus, if G is ethylene (-CH 2 -CH a -) either one of the two ethylene carbons may 
have a R 2 substituent, and R 2 is selected from alkyl, alkenyl, alkynyl, cycloalkyl, aryl-fused cycloalkyi, cycioalkonyi, 
aryi, aralky!. aryl-substituted alkenyl or alkynyl, cycloalkyl-substituted alkyl, cycloalkenyl-substituted cycloalkyl, biaryl, 
alkoxy, alkenoxy, alkynoxy, aralkcxy, aryl-substituted alkenoxy or alkynoxy, alkylamino, alkenylamino or alkynylamino, 
aryl-substituted alkylamino, aryl-substituted alkenylamino or alkynylamino, aryioxy, arylamino, N-alkylurea-substituted 
alkyl. N-arylurea-substitutod alkyl, alkylcarbonylamino-substituted alkyl. aminocarbonyl-substituted alkyl, heterocyclyl, 
heterocyclyl-substituted alkyl, heterocyciyl-substituted amino, carboxyaikyl substituted araikyl, oxocarbocyclyl-fused 
aryl and heterocyclylalkyl; cycloalkenyi, aryl-substituted alkyl and, araikyl, hydroxy-substituted alkyl, alkoxy-substituted 
alkyl, aralkoxy-substituted alkyl, alkoxy-substituted alkyl, aralkoxy-substituted alkyl, amino-substituted alkyl, (aryl-sub- 
stituted alkyloxycarbonylamino)-substituted alkyl, thiol-substituted alkyl, alkylsulfonyl-substituted alkyl, (hydroxy-sub- 
stituted aikyithio)-substituted alkyl, thioalkoxy-substituted alkyl, hydrocarbylacylamino-substituted alkyl, heterocycly- 
lacylamino-substituted alkyl, hydrocarbyl-substituted-heterocyclyiacylamino-substituted alkyl, alkylsulfonylamino-sub- 
stituted alkyl, arylsulfonylammo-substituted alkyl, morpholino-alkyl, thiomorpholino-alkyL morpholino carbonyl-substi- 
tuted alkyl, thiomorpholinocarbonyl-substituted alkyl, [N-(alkyl, alkenyl or alkynyl)- or N,N-[dialkyl, dialkenyl, dialkynyl 
or (alkyl, alkenyl)-amino]carbonyl-substituted alkyl, heterocyclylaminocarbonyl, heterocylylalkyleneaminocarbonyl, 
heterocyclylaminocarbonyl-substituted alkyl, heterocylylalkyleneaminccarbonyl-substituted alkyl, N,N-[diaikyl]alkyle- 
neaminocarbonyl, N,N-[dialkyl]a!kyleneaminocarbonyl-substituted alkyl, alkyl-substituted heterocyciyfearbonyl, alkyl- 
substituted heterocyciylcarbonyl-alkyl, carboxyl-substituted alkyl, dialkylamino-substituted acylammoalkyl and amino 
acid side chains selected from arginine. asparagine, glutamine. S-methyl cysteine, methionine and corresponding 
sulfoxide and sulfone derivatives thereof, glycine, leucine : isoleucine, allo-isoleucine, tert-leucine, norleucine, pheny- 
lalanine, tyrosine, tryptophan, proline, alanine, ornithine, histidine, glutamine, valine, threonine, serine, aspartic acid, 
beta-cyanoalanine : and allothreonine; alynyl and heterocyclylcarbonyl, aminocarbonyl, amido, mono- or dialkylamino- 
carbonyl, mono- or diarylaminocarbonyl, alkylarylaminocarbonyl, diarylaminocarbonyl, mono- or diacylaminocarbonyl, 
aromatic or aliphatic acyl, alkyl optionally substituted by substituents selected from amino, carboxy, hydroxy, mercapto, 
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mono-ordialkylamino, mono- ordiarylamino, alkylarylamino, diarylamino, mono- ordiacylamino, alkoxy, alkenoxy, ary- 
loxy, thioalkoxy, thioalkenoxy, thioalkynoxy, thioaryloxy and hetarocyclyl. 
[0117] A preferred compound of the formula "^-(J-T^-L-X has the structure: 



T 4 

f 



10 



15 




wherein G is (CH 2 ) 1 . 6 such that a hydrogen on one and only one of the CH 2 groups represented by a single M G" is 
20 replaced with-(CH 2 ) c -Amde-T 4 ; T 2 and T 4 are organic moieties of the formula C, .:5N 0 . g O (>g H a Fp such that the sum of 
a and p is sufficient to satisfy the otherwise unsatisfied valencies of the C, N, ana O atoms; amide is 



25 



35 



o o 

li II 

— N-C — or — C-N — ; 
' I 1 t 

R 1 is hydrogen or C^q alkyl; c is an integer ranging from 0 to 4; and n is an integer ranging from 1 to 50 such that 
when n is greater than 1, G, c. Amide, R 1 and T 4 are independently selected. 

[0118] In a further preferred embodiment a compound of the formula T^-(J-T 3 -) n -l_-X has the structure: 



r 
i 

Amide 



o (c r H 2)c R ' o 

II I ! II 

X 



R ' O (CH 2 ) C 

45 ^ I 



Amide 



so wherein T 5 is an organic moiety of the formula C 1 . 25 N 0 . g O 0 . 9 H 0t Fj3 such that the sum of a and |3 is sufficient to satisfy 
the otherwise unsatisfied valencies of the C, N, and O atoms; and T 5 includes a tertiary or quaternary amine or an 
organic acid; m is an integer ranging from 0-49, and T 2 , T 4 , R 1 , L*and X have been previously defined. 
[0119] Another preferred compound having the formula T2-(J-T 3 -) n -L-X has the particular structure: 



55 
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wherein T s is an organic moiety of the formula C 1 . 25 N 0 .gO 0 . 9 H a Fu such that the sum of a and p" is sufficient to satisfy 
the otherwise unsatisfied valencies of the C, N, and O atoms; and I s includes a tertiary or quaternary amine or an 
organic acid: m is an integer ranging from 0-49, and T 2 . T 4 , c, R 1 , "Amide", L and X have been previously defined. 
[0120] In the above structures that have a T s group, -Amide-T 5 is preferably one of the following, which are conven- 
iently made by reacting organic acids with free amino groups extending from "G": 



•NHC 

II 

O 




o 



N 

<C,-C, 0 ) 



NHC-(C,-C I0 )-N 



-NHC 

II 
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— <^)-O-(C 2 -C| 0 )-N(C,-C, 0 ) 2 




■NHC-(C 0 -C IO )-jf- 
O 




•NHC 

II 
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N-(C,— C 10 ); 



and — NHC— (C, — C| 0 )-N' 
O 



[0121] Where the above compounds have a T s group, and the "G" group has a free carboxyl group (or reactive 
equivalent thereof), then the following are preferred -Amide-T 5 group, which may conveniently be prepared by reacting 
the appropriate organic amine with a free carboxyl group extending from a "G" group: 



-CNH- 
II 
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[0122] In three preferred embodiments of the invention, T-L-MOI has the structure: 



25 



30 



35 




(Cj — C J0 ) — ODN — 3 — OH 



or the structure: 
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(Cj — C| 0 ) — ODN — 3 — OH 
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or the structure: 
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wherein T 2 and T 4 are organic moieties of the formula C 1 _ 25 N 0 .g0 0 . g S 0 . 3 Po.3H a Fj i l5 such that the sum of a, (3 and 5 is 
sufficient to satisfy the otherwise unsatisfied valencies of the C, N, O, S and P atoms; G is (Chy^ wherein one and 
is only one hydrogen on the CH 2 groups represented by each G is replaced with -(CH 2 ) C -Amide-T 4 ; Amide is 

o o 
ii ii 

20 — N-C— or — C-N — ; 

R> R' 

R 1 is hydrogen or C w0 alkyl; c is an integer ranging from 0 to 4: "Co^io" represents a hydrocarbylene group having 
from 2 to 10 carbon atoms, "ODN-3'-OH" represents a nucleic acid fragment having a terminal 3' hydroxyl group (i.e., 
a nucleic acid fragment joined to (C r C 10 ) at other than the 3' and of the nucleic acid fragment); and n is an integer 
ranging from 1 to 50 such that when n is greater than 1, then G, c, Amide, R 1 and T 4 are independently selected. 
Preferably there are net three heteroatoms bonded to a single carbon atom. 

[0123] In structures as set forth above that contain a l^-C^OJ-NfR 1 }- group, this group may be formed by reacting 
30 an amine of the formula HN(R 1 )- with an organic acid selected from the following, which are exemplar/ only and do 
not constitute an exhaustive list of potential organic acids: Formic acid, Acetic acid, Propiolic acid, Propionic acid, 
Fluoroacetic acid, 2-Butynoic acid, Cyclopropanecarboxylic acid, Butyric acid, Methoxyacetic acid, Difluoroacetic acid, 
4-Pentynoic acid, Cyciobutanecarboxylic acid, 3,3-Dimethylacryltc acid, Valeric acid, N,N-Dimethylglycine, N-Formyl- 
Gly-OH, Ethoxyacetic acid, (Methylthio)acetic acid, Pyrrole-2-carboxy!ic acid, 3-Furoic acid, lsoxazole-5-carboxylic 
^5 acid, trans-3-Hexenoic acid, Trifluoroacetic acid, Hexanoic acid, Ac-Gly-OH, 2-Hydroxy-2-methylbutyric acid, Benzoic 
acid.. Nicotinic acid, 2-Pyrazinecarboxylic acid, t -Methyl-2-pyrrolecarboxylic acid, 2-Cyclopentene-1 -acetic acid, Cy- 
clopentylacetic acid, (S)-(-)-2-Pyrroiidone-5-carboxylic acid, N-Methyl-L-proline, Heptanoic acid, Ac-b-Ala-OH, 2-Ethyl- 

2- hydroxy butyric acid, 2-(2-Methoxyethoxy)acetic acid, p-Toluic acid, 6-Methylnicotinic acid, 5-Methyl-2-pyrazinecar- 
boxylic acid, 2,5-Dimethylpyrrole-3-carboxylic acid, 4-Fluorobenzoic acid, 3,5-Dimethyllsoxazole-4-carboxylic acid, 

to 3-Cyclopentyipropionic acid, Octanoic acid, N,N-Dimethylsuccinamtc acid, Phenylpropiolic acid, Cinnamic acid, 

4- Ethylbenzoic acid, p-Anisic acid, 1 ,2,5-Trimethylpyrroie-3-carboxy!ic acid, 3-Fiuoro-4-methylbenzoic acid, Ac-DL- 
Propargylglycine, 3-(Trifluoromethyl)butyricacid, 1-Piperidinepropionic acid, N-Acetylproline, 3,5-Difluorobenzoicacid, 
Ac-L-Val-OH, lndole-2-carboxylic acid, 2-Benzofurancarboxylic acid, Benzotriazola-5-carboxylic acid, 4-n-Propylben- 
zoic acid, 3-Dimethylaminobenzoic acid, 4-Ethoxybenzoic acid, 4-(Methylthio)benzoic acid, N-(2-Furoyl)glycine, 2- 

*s (Methylthio)nicotinic acid, 3-Fluoro-4-methoxybenzoic acid, Tfa-Gly-OH, 2-Napthoic acid, Quinaldic acid, Ac-L-lle-OH, 

3- Methylindene-2-carboxy!ic acid, 2-Quinoxalinecarboxylic acid, 1 -Methylindole-2-carboxylic acid, 2,3,6-Trifluoroben- 
zoic acid, N-Formyl-L~Met-OH, 2-(2-(2-Methoxyethoxy)ethoxy]acetic acid, 4-n-Butylbenzoic acid, N-Benzoylglycine, 

5- Fluoroindole-2-carboxylic acid, 4-n-Propoxybenzoic acid, 4-Acetyl-3,5-dimethyl-2-pyrrolecarboxylic acid, 
3,5-Dimethoxybenzoic acid, 2 : 6-Dimethoxynicotinic acid, Cyclohexanepentanoic acid, 2-Naphthylacetic acid, 4-(lH- 

so pyrrol-1-yl)benzoic acid, lndole-3-propionic acid, m-Trifluoromethylbenzoic acid, 5-Methoxyindole-2-carboxylic acid, 

4- Pentylbenzoic acid, Bz-b-Ala-OH, 4-Diethylaminobenzoic acid, 4-n-Butoxybenzoic acid, 3-Methyl-5-CF3nsoxazole- 
4-carboxylic acid ; (3,4-Dimethoxyphenyl)acetic acid, 4-Biphenylcarboxylic acid, Pivaloyl-Pro-OH, Octanoyl-Gly-OH, 
(2-Naphthoxy)acetic acid, lndole-3-butyric acid, 4-(Trifluoromethyl)pheny!acetic acid, 5-Methoxyindole-3-acetic acid, 
4-(Trifluoromethoxy)benzoic acid, Ac-L-Phe-OH, 4-Pentyloxybenzoic acid, Z-Gly-OH, 4-Carboxy-N-(fur-2-ylmethyl) 

55 pyrrolidin-2-one, 3,4-Diethoxybenzoic acid, 2,4-Dimethyl-5-C0 2 Et-pyrrole-3-carboxylic acid, N-(2-Fluorophenyl)succi- 
namic acid, 3,4,5-Trimethoxybenzoic acid, N-Phenylanthranilic acid, 3-Phenoxybenzoic acid, Nonanoyl-Gly-OH, 
2-Phenoxypyridine-3-carboxylic acid, 2,5-DimethyM -phenylpyrrole«3-carboxylic acid, trans-4-(Trifluoromethyl)cin- 
namic acid, (5-Methyl-2~phenyloxazol-4-yl)acetic acid, 4-(2-Cyclohexenyloxy)benzoic acid, 5-Methoxy-2-methylindole- 
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3-acetic acid, trans-4-Cotininecarboxylic acid, Bz-5-Aminovaleric acid, 4-Hexyloxybenzoic acid, N-(3-Methoxyphenyl) 
succinamic acid, Z-Sar-OH, 4-(3,4-Dimethoxyphenyl)butyric acid, Ac-o-Fluoro-DL-Phe-OH, N-(4-Fluorophenyl)gluta- 
ramic acid, 4'-Ethyl-4-biphenylcarboxylic acid, 1 ,2,3,4-Tetrahydroacridinecarboxylic acid, 3-Phenoxyphenylacetic acid, 
N-(2,4-Difluorophenyl)succinamic acid, N-Decanoyl-Gly-OH, (4-)-6-Methoxy-a-methyl-2-naphthalenaacetic acid, 3- 

5 (Trifluoromethoxy)cinnamic acid, N-Formyl-DL-Trp-OH, (R)-(+)-a-Methoxy-a-(trifluoromethyl)phenylacetic acid, Bz- 
DL-Le.u-OH, 4-(Trifluorom9thoxy)phenoxyacetic acid. 4-Heptyloxybenzoic acid, 2,3,4-Trimethoxycinnamic acid, 
2,6-Dimethoxybenzoyl-Gly-OH, 3-(3 r 4,5-Trimethoxyphenyl)propionic acid, 2,3,4,5,6-Pentafluorophenoxyacetic acid, 
N-(2,4-Difluorophenyl)glutaramic acid, N-Undecanoyl-Gly -OH, 2-(4-Fluorobanzoyl)benzoic acid, 5-Trifluoromethoxyin- 
dole-2-carboxylicacid, N-(2,4-Difluorophenyl)digiycolamicacid, Ac-L-Trp-OK Tfa-L-Phenylglycine-OH, 3-lodobenzoic 

/o acid ; 3-(4-n-Pentylbenzoyl)propionic acid, 2-Phenyl-4-quinolinecarboxylic acid, 4-Octyloxyb8nzoic acid, Bz-L-Met-OH, 
3,4,5-Triethoxybenzoicacid, N-Lauroyl-Gly-OH, 3,5-Bi5(trifluoromQthyi)benzoic acid, Ac-5-Methyl-DL-Trp-OH, 2-lodo- 
phenylacetic acid, 3-iodo-4-methylbenzoic acid, 3-(4-n-Hexyibenzoyl)propionic acid, N-Hexanoyl-L-Phe-OH, 4-Nony- 
loxybenzoic acid, 4XTnfluoromethyl)-2-biphenylcarboxylic acid, Bz-L-Phe-OH, N-Tridecanoyl-Gly-OH, 3,5-Bis(trifluor- 
omethyl)phenylacetic acid, 3-(4-n-Heptylbenzoyl)prcpionic acid, N-Hepytanoyl-L-Phe-OH, 4-DecyIoxybenzoic acid, N- 

'5 (a,a,a-trifluoro-m-tolyl)anthranilic acid, Niflumic acid, 4-(2-Hydroxyhexafluoroisopropyl)benzoic acid, N-Myristoyl-Gly- 
OH, 3-(4-n-Octylbenzoyl)propionic acid, N-Octanoyl-L-Phe-OH, 4-Undecyloxybenzoic acid, 3-(3,4,5-Trimethoxyphe- 
nyl)propionyl-Gly-OH, 8-lodonaphthoic acid, N-Pentadecanoyl-GIy-OH, 4-Dodecyloxybanzoic acid, N-Palmitoyl-Gly- 

KY, Bachem Bioscience inc., Torranca. CA; Calbiochem-Novabiochem Corp., San Diego, CA; Farchan Laboratories 
20 inc., Gainesville FL, Lancaster Synthesis, Windham NH; and MayBridgo Chemical Company (do Ryan Scientific), 
Columbia, SC The catalogs from these companies use the abreviations which are used above to identify the acids. 

f. Combinatorial Chemistry as a Means for Pro pa ring Tags 

-5 [0 124] Combinatorial chemistry is a type of synthetic strategy which leads to the production of large chemical libraries 
(see, for example, PCT Application Publication No. WO 94/08051). Those combinatorial libraries can be used as tags 
for the identification of molecules of interest (MOIs). Combinatorial cnemistry may be defined as the systematic and 
repetitive, covalent connection of a set of different "building blocks" of varying structures to each other to yield a large 
array of diverse molecular entities. Building blocks can take many forms, both naturally occurring and synthetic, such 
as nucleophiles. electrophiles, diones, alkylating or acy luting agents, diamines, nucleotides, amino acids, sugars, lipids, 
organic monomers, synthons, and combinations of the above. Chemical reactions used to connect the building blocks 
may involve alkytation, acylation, oxidation, reduction, hydrolysis, substitution, elimination, addition, cyclization, con- 
densation, and the like. This process can produce libraries of compounds which are oligomeric, ncn-oligomeric, or 
combinations thereof. If oligomeric, the compounds can be branched, unbranched, or cyclic Examples of oligomeric 
structures which can be prepared by combinatorial methods include oligopeptides, oligonucleotides, oligosaccharides, 
polylipids, polyesters, pclyamides, polyurethanes, poiyureas, polyethers, poiy(phosphorus derivatives), e.g., phos- 
phates, phosphonates, phosphoramides, phosphonamides, phosphites, phosphinamides, etc., and poly(sulfur deriv- 
atives), e.g., sulfcnes, sulfonates, sulfites, sulfonamides, suffenamides, etc. 

[0125] One common type of oligomeric combinatorial library is the peptide combinatorial library. Recent innovations 
4 o in peptide chemistry and molecular biology have enabled libraries consisting of tens to hundreds of millions of different 
peptide sequences to be prepared and used. Such libraries can be divided into three broad categories. One category 
of libraries involves the chemical synthesis of soluble nonsupport -bound peptide libraries (e.g., Houghten at al. , Nature 
354S4, 1991). A second category involves the chemical synthesis of support-bound peptide libraries, presented on 
solid supports such as plastic pins, resin beads, or cotton (Geysen et al., Mol. Immunol. 23:709, 1986; Lam et al., 
4 $ Nature 354:82, 1991; Eichlerand Houghten, Biochemistry 3211035, 1993). In these first two categories, the building 
blocks are typically L-amino acids, D-amino acids, unnatural amino acids ; or some mixture or combination thereof. A 
third category uses molecular biology approaches to prepare peptides or proteins on the surface of filamentous phage 
particles or plasmids (Scott and Craig, Curr. Opinion Biotech. 5:40, 1994). Soluble, nonsupport-bound peptide libraries 
appear to be suitable for a number of applications, including use as tags. The available repertoire of chemical diversities 
so in peptide libraries can be expanded by steps such as permethylation (Ostresh et al., Proc. Natl. Acad. Set., USA 91: 
11138, 1994). 

[0126] Numerous variants of peptide combinatorial libraries are possible in which the peptide backbone is modified, 
and/or the amide bonds have been replaced by mimetic groups. Amide mimetic groups which may be used include 
ureas, urethanes, and carbonylmethylene groups. Restructuring the backbone such that sidechains emanate from the 
55 amide nitrogens of each amino acid, rather than the alpha-carbons, gives libraries of compounds known as peptoids 
(Simon et al., Prcc. Natl. Acad Sci. r USA 35:9367, 1992). 

f0127| Another common tvne of oiioomeric combinatorial librarv is the oligonucleotide combinatorial library where 
the building blocks are some form of naturally occurring or unnatural nucleotide or polysaccharide derivatives, including 
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whore various organic and inorganic groups may substitute (or the phosphate linkage, and nitrogen or sulfur may 
substitute for oxygen in an ether linkage (Schneider et al., Biochem. 34:9599, 1995; Freier et al., J. Med Chom. 38: 
344, 1995; Frank, J. Biotechnology 41, ,1259, 1995; Schneider et a!., Published PCT WO 942052; Ecker et al, Nucleic 
Acids Res. 27:1853, 1993). 

5 [0128] More recently, the combinatorial production of collections of non-oligomeric, small molecule compounds has 
been described (DeWitt at ai. ( Proc. Natl. Acad. Sci., USA 90:690, 1993, Bunin et ai. t Proc. Natl. Acad. Sci., USA 91: 
4708, 1994). Structures suitable for elaboration into small-molecule libraries encompass a wide variety of organic 
molecules, for example heterocyclics, aromatics, alicyclics, aliphatics, steroids, antibiotics, enzyme inhibitors, ligands, 
hormones, drugs, alkaloids, opioids, terpenes, porphyrins, toxins, catalysts, as well as combinations thereof. 

10 

g. Specific Methods for Combinatorial Synthesis of Tags 

[0129] Two methods for the preparation and use of a diverse set of amine-containing MS tags are outlined below. 
In both methods, solid phase synthesis is employed to enable simultaneous parallel synthesis of a large number of 
15 tagged linkers, using the techniques of combinatorial chemistry. In the first method, the eventual cleavage of the tag 
from the oligonucleotide results in liberation of a carboxyl amide. In the second method, cleavage of the tag produces 
a cnrbcxylic acid. The chemical components and linking elements used in those methods are abbreviated as follows: 



R - 


resin 


FMOC = 


fluorenylmethoxycarbonyl protecting group 


All = 


ailyl protecting group 


CCUH = 


carboxylic acid group 


CONH 2 = 


carboxylic amide group 


NH 2 ~ " 


amino group 


Ol-f- 


hydroxy! group 


CONH = 


amide linkage 


COO = 


ester linkage 


NH 2 - Rink - C0 2 H = 


4-((a-amina)-2,4-dimethoxybenzyi]-phenoxybutync acid (Rink linker) 


OH - 1MeO - C0 2 H = 


(4-hydroxymethyl)phenoxybutyric acid 


OH-2MeO-C0 2 hT= 


(4-hydroxymethyl-3-methoxy)phenoxyacGtic acid 


NH 2 -A-COOH^ 


amino acid with aliphatic or aromatic amine functionality in side chain 


X1....Xn-COOH = 


set of n diverse carboxylic acids with unique molecular weights 


oligol ... oligo(n) = 


set of n oligonucleotides 


HBTU ^ 


O-benzotnazoM -yl-N,N,N\NMetramethyluronium hexafluorophosphate 


[0130] The sequence of steps in Method 1 is as follows: 



40 



45 



50 



55 
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OH - 2MeO - CONH - R 

I FMOC - NH - Rink - C0 2 H; couple (e.g. , HBTU) 
FMOC - NH - Rink - COO - 2MeO - CONH - R 
1 piperidine (remove FMOC) 
10 NH 2 - Rink - COO - 2MeO - CONH - R 

i FMOC - NH - A - COOH; couple (e.g. , HBTU) 
FMOC - NH - A - CONH - Rink - COO - 2MeO - CONH - R 

15 

I piperidine (remove FMOC) 

NH 2 - A - CONH - Rink - COO - 2MeO - CONH - R 

20 1 divide into n aliquots 

iiiii couple to n different acids XI .... Xn - COOH 

XI Xn - CONH - A - CONH - Rink - COO- 2MeO - CONH - R 

25 iiiil Cleave tagged linkers from resin with 1% TFA 

XI Xn - CONH - A -CONH - Rink - C0 2 H 



30 



35 



40 



45 



lilil couple to n oligos (oligol oligo(n)) 

(e.g., via Pfp esters) 

XI Xn - CONH - A - CONH - Rink - CONH - oligol oligo(n) 

i pool tagged oligos 

i perform sequencing reaction 

I separate different length fragments from 

sequencing reaction (e.g. , via HPLC or CE) 
i cleave tags from linkers with 25%- 100% TFA 

XI Xn - CONH - A - CONH 

1 

analyze by mass spectrometry 
[0131] The sequence of steps in Method 2 is as follows: 



so 



55 
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OH - IMcO-COj - AJ] 

I FMOC - NH - A - C0 2 H; couple (e.g., HBTU) 

FMOC - NH - A - COO - lMeO-C0 2 -All 

4- Palladium (remove Ally!) 

FMOC - NH - A - COO - lMeO - C0 2 H 

i OH - 2MeO - CONH - R; couple (e.g. , HBTU) 

FMOC - NH - A - COO - lMeO - COO - 2MeO - CONH - R 

•1 piperidine (remove FMOC) 

NH 2 - A - COO - IMeO - COO - 2MeO - CONH - R 

4- divide into n aliquots 
444444 couple to n different acids XI Xn - CO,H 

XI Xn - CONH - A - COO - IMeO - COO - 2MeO - CONH - R 

44144 cleave tagged linkers from resin with 1% TFA 

X 1 Xn - CONH - A - COO - 1 MeO - C0 2 H 

44444- couple to n oiigos (oligol oligo(n)) 

(e.g., via Pfp esters) 

XI Xn- CONH -A- COO - 1 MeO - CONH - oligol oligo(n) 

I pool tagged oiigos 

I perform sequencing reaction 

4- separate different length fragments from 

sequencing reaction (e.g. , via HPLC or CE) 
i cleave tags from linkers with 25- 1 00% TFA 

XI Xn - CONH - A - C0 2 H 

i 

analyze by mass spectrometry 

2. Linkers 

[0132] A "linker" component (orL), as used herein, means either a direct covalent bond or an organic chemical group 
which is used to connect a "tag" (orT) to a "molecule of interest" (or MOI) through covalent chemical bonds. In addition, 
the direct bond itself, or one or more bonds within the linker component is cleavable under conditions which allows T 
to be released (in other words, cleaved) from the remainder of the T-L-X compound (including the MOI component). 
The tag variable component which is present within T should be stable to the cleavage conditions. Preferably, the 
cleavage can be accomplished rapidly; within a few minutes and preferably within about 15 seconds or less. 
[0133] In general, a linker is used to connect each of a large set of tags to each of a similarly large set of MOI s. 
Typically, a single tag-linker combination is attached to each MOI (to give various T-L-MOI), but in some cases, more 
than one tag-linker combination may be attached to each individual MOI (to give various (T-L)n-MOI). In another em- 
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bodiment of the present invention, two or more tags are bonded to a single linker through multiple, independent sites 
on the linker, and this multiple tag-linker combination is then bonded to an individual MOI (to give various (T)n-L-MOI). 
[0134] After various manipulations of the set of tagged MOIs, special chemical and/or physical conditions are used 
to cleave one or more covalent bonds in the linker, resulting in the liberation of the tags from the MOIs. The cleavable 
bond(s) may or may not be some of the same bonds that were formed when the tag, linker, and MOI were connected 
together. The design of the linker will, in large part, determine the conditions under which cleavage may be accom- 
plished. Accordingly, linkers may be identified by the cleavage conditions they are particularly susceptible too. When 
a linker is photolabile [i.e., prone to cleavage by exposure to actinic radiation), the linker may be given the designation 
L h ^. Likewise, the designations L acid , L base , U°l U R I, L enz , L elc , L A and L 33 may be used to refer to linkers that are 
particularly susceptible to cleavage by acid, base, chemical oxidation, chemical reduction, the catalytic activity of an 
enzyme (more simply "enzyme"), electrochemical oxidation or reduction, elevated temperature ("thermal") and thiol 
exchange, respectively. 

[0135] Certain types of linker are labile to a single type of cleavage condition, whereas others are labile to several 
types of cleavage conditions. In addition, in linkers which are capable of bonding multiple tags (to give (T)n-L-MOI type 
structures), each of the tag-bonding sites may be labile to different cleavage conditions. For example., in a linker having 
two tags bonded to it, one of the tags may be labile only to base, and the other labile only to photolysis. 
[0136] A linker which is useful in the present invention possesses several attributes: 

1 ) The linker possesses a chemical handle (L h ) through which it can be attached to an MOI. 

2) The linker possesses a second, separate chemical handle (L^) through which the tag is attached to the linker. 
If multiple tags are attached to a single linker ((T)n-L-MOI type structures), then a separate handle exists for each 
tag. 

3) The linker is stable toward all manipulations to which it is subjected, with the exception of the conditions which 
allow cleavage such that a T-containing moiety is released from the remainder of the compound, including the 
MOI Thus, the linker is stable during attachment of the tag to the linker, attachment of the linker to the MOI, and 
any manipulations of the MOI while the tag and linker (T-L) are attached to it. 

4) The linker does not significantly interfere with the manipulations performed on the MOI while the T-L is attached 
to it. For instance, if the T-L is attached to an oligonucleotide, the T-L must not significantly interfere with any 
hybridization or enzymatic reactions (e.g., PCR) performed on the oligonucleotide Similarly, if the T-L is attached 

to an antibody, it must not significantly interfere with antigen recognition cy the antibody. 

5) Cleavage of the tag from the remainder of the compound occurs in a highly controlled manner, using physical 
or chemical processes that do not adversely affect the detectability of the tag. 

[0137] For any given linker, it is preferred that the linker be attachable to a wide variety of MOIs, and that a wide 
variety ot tags be attachable to the linker. Such flexibility is advantageous because it ailows a library of T-L conjugates, 
once prepared, to be used with several different sets of MOIs. 
[0138] As explained above, a preferred linker has the formula 

L h -L 1 -L 2 -L 3 -L h 

wherein each L h is a reactive handle that can be used to link the linker to a tag reactant and a molecule of interest 
reactant. L 2 is an essential part of the linker because L 2 imparts lability to the linker. L 1 and L 3 are optional groups 
which effectively serve to separate L 2 from the handles L h . 

[0139] L 1 (which, by definition, is nearer to T than is L 3 ), serves to separate T from the required labile moiety L 2 . This 
separation may be useful when the cleavage reaction generates particularly reactive species (e.g., free radicals) which 
may cause random changes in the structure of the T-containing moiety. As the cleavage site is further separated from 
the T-containing moiety, there is a reduced likelihood that reactive species formed at the cleavage site will disrupt the 
structure of the T-containing moiety. Also, as the atoms in L1 will typically be present in the T-containing moiety, these 
L 1 atoms may impart a desirable quality to the T-containing moiety. For example, where the T-containing moiety is a 
T ms -containing moiety, and a hindered amine is desirably present as part of the structure of the T ms -containing moiety 
(to serve, e.g., as a MSSE), the hindered amine may be present in L 1 labile moiety. 

[0140] In other instances, L 1 and/or L 3 may be present in a linker component merely because the commercial supplier 
of a linker chooses to sell the linker in a form having such a L 1 and/or L 3 group. In such an instance, there is no harm 
m using linkers having V and/or L 3 groups, (so long as these group do not inhibit the cleavage reaction) even though 
they may not contribute any particular performance advantage to the compounds that incorporate them. Thus, the 
present invention allows for L 1 and/or L 3 groups to be present in the linker component. 

[0141] L 1 and/or L 3 groups may be a direct bond (in which case the group is effectively not present), a hydrocarbylene 
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group (e.g., alkylene, arylene, cycioalkylene, etc.), -O-hydrocarbylene (e.g., -0-CH 2 -, Q-CH 2 CH(CH 3 )-, etc.) orhydro- 
carbylene-(0-hydrocarbylene) w - wherein w is an integer ranging from 1 to about 10 (e.g., -CH 2 -0-Ar-, -CH 2 -(0- 
CH 2 CH 2 ) 4 -, etc.). 

[0142] With the advent of solid phase synthesis, a great body of literature has developed regarding linkers that are 
labile to specific reaction conditions. In typical solid phase synthesis, a solid support is bonded through a labile linker 
to a reactive site, and a molecule to be synthesized is generated at the reactive site. When the molecule has been 
completely synthesized, the solid support-linker-molecule construct is subjected to cleavage conditions which releases 
the molecule from the solid support. The labile linkers which have been dovelopod for use in this context (or which 
may bo used in this context) may also be readily used as the linker reactant in the present invention. 
[0143] Lloyd-Williams, P., et al., "Convergent Solid-Phase Peptide Synthesis", Tetrahedron Report No. 347, 49(48): 
11065-11133 (1993) provides an extensive discussion of linkers which are labile to actinic radiation (i.e., photolysis), 
as well as acid, base and other cleavage conditions. Additional sources of information about labile linkers may be 
readily obtained. 

[0144] As described above, different linker designs will confer cleavability ("lability") under different specific physical 
or chemical conditions. Examples of conditions which serve to cleave various designs of linker include acid, base, 
oxidation, reduction, fluoride, thiol exchange, photolysis, and enzymatic conditions. 

[0145] Examples of cleavabie linkers that satisfy the general criteria for linkers listed above will be well known to 
these m iho art and include muse round in the catalog available from nerco ^noeMcro, iL). examples include; 

• ethylene glycobis(succmimidylsuccinate) (EGS), an amine reactive cross-linking reagent which is cleavabie by 
hydroxylamine (1 M at 37°C for 3-6 hours); 

• disuccinimidyl tartarate (DST) and sulfc-DST, which are amine reactive uross-linking reagents, cleavabie by 0.01 5 
M sodium periodate; 

bis[2-(succinimidyloxycarbonyioxy)ethyl]sulfone (BSOCOES) and sulfo-BSOCOES, which arc amine reactive 
cross-linking reagents, cleavabie by base (pH 11.6); 

• 1 ,4-di-[3 , -(2'-pyridyldithio(propionamido))butane (DPDPB), a pyridyldithiol crosslinker which is cleavabie by thiol 
exchange or reduction. 

N-[4"(p-azidcsalicylarn!do}^buty!]-3 , -;2 , -pyridydithio)propionamide (APDP), a pyridyldithiol crosslinker which is 
cleavabie by thiol exchange or reduction: 

bis-;bota-4-(az!dC3a!icyianidc)ethyl|-di3ulf'de. a phctcreactive crosslinker which is cleavabie by thiol exchange or 
reduction; 

• N-succinimidyl-(4-azidophenyl)-1 ,3'dithiopropionate (SADP), a photoreactive crosslinker which is cleavabie by thi- 
ol exchange or reduction; 

• 3u!fosuccinimidyl-2-(7-azido-4-methylcoumarin-3-acetamide)ethyl-1,3 , -dithiopropionato (SAED), a photoreactive 
crosslinker which is cleavabie by thiol exchange or reduction; 

• sulfosuccinimidyl-2-(m-azido-o-nitrobenzamido)-ethyi-1,3 , dithiopropicnate (SAND), a photoreactive crosslinker 
which is cleavabie by thiol exchange or reduction. 

[0146] Other examples of cleavabie linkers and the cleavage conditions that can be used to release tags are as 
follows. A siiyl linking group can be cieaved by fluoride or under acidic conditions. A 3-, 4-, 5-, or 6-substituted-2-ni- 
trobenzyloxy or 2-, 3- : 5-, or 6-substituted-4-nitrobenzyloxy linking group can be cleaved by a photon source (photol- 
ysis). A 3-, 4-, 5-, or 6-substituted-2-alkoxyphenoxy or 2-, 3-, 5-, or 6-substi!uted-4-alkoxyphenoxy linking group can 
bo cleaved by Ce(NH 4 ) 2 (N0 3 ) 6 (oxidation). A NC0 2 (urethane) linker can be cleaved by hydroxide (base), acid, or 
LiAIH 4 (reduction). A 3-pentenyl, 2-butenyl, or 1-butenyl linking group can be cleaved by 0 3 , 0 S 0 4 /I0 4 ", or KMn0 4 
(oxidation). A 2-[3-, 4-, or 5-substituted-furyl]oxy linking group can be cleaved by 0 2 , Br 2 , MeOH, or acid. 
[0147] Conditions for the cleavage of other labile linking groups include: t-alkyloxy linking groups can be cleaved by 
acid; methyl(dialkyl)methoxy or 4-substituted-2-alky!-1,3-dioxlane-2-yl linking groups can be cleaved by H 3 0; 2-si- 
lylethoxy linking groups can be cleaved by fluoride or acid; 2-(X)-ethoxy (where X = keto, ester amide, cyano, N0 2 , 
sulfide, sulfoxide, sulfone) linking groups can be cieaved under alkaline conditions; 2-, 3-, 4-, 5-, or 6-substituted- 
benzyloxy linking groups can be cleaved by acid or under reductive conditions; 2-butenyloxy linking groups can be 
cleaved by (Ph 3 P) 3 RhCI(H), 3-, 4-, 5-, or 6-substituted-2-bromophenoxy linking groups can be cleaved by Li, Mg, or 
BuLi; methylthiomethoxy linking groups can be cleaved by Hg 2+ ; 2-(X)-ethy!oxy (where X = a halogen) linking groups 
can be cleaved by Zn or Mg; 2-hydroxyethyloxy linking groups can be cleaved by oxidation (e.g., with Pb(OAc) 4 ), 
[0148] Preferred linkers are those that are cleaved by acid or photolysis. Several of the acid-labile linkers that have 
been developed for solid phase peptide synthesis are useful for linking tags to MOIs. Some of these linkers are de- 
scribed in a recent review by Lloyd-Williams et al. (Tetrahedron 49:11065-11133, 1993). One useful type of linker is 
based upon p-alkoxybenzyl alcohols, of which two, 4-hydroxymethylphenoxyacetic acid and 4-(4-hydroxymethyl- 
3-methoxyphenoxy)butyric acid, are commercially available from Advanced ChemTech (Louisville, KY). Both linkers 
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can be attached to a tag via an ester linkage to the benzylalcohol, and to an amine-containing MO I via an amide linkage 
to the carboxylic acid. Tags linked by these molecules are released from the MOI with varying concentrations of trif- 
luoroacetic acid. The cleavage of these linkers results in the liberation of a carboxylic acid on the tag. Acid cleavage 
of tags attached through related linkers, such as 2.4-dimethoxy-4'-(carboxymethyloxy)-benzhydrylamine (available 

5 from Advanced ChemTech in FMOC-protected form), results in liberation of a carboxylic amide on the released tag. 
[0149] The photolabile linkers useful for this application have also been for the most part developed for solid phase 
peptide synthesis (see Lloyd-Williams review). These linkers are usually based on 2-nitrobenzylesters or 2-nitroben- 
zylamides. Two examples of photolabile linkers that have recently been reported in the literature are 4-(4-(1-Fmoc- 
amino)ethyl)-2-methoxy-5-nitrophenoxy)butanoic acid (Holmes and Jones, J. Org. Chem. 60.-2318-2319, 1995) and 

i0 3-(Fmoc-amino)-3-(2-nitrophenyl)propionic acid (Brown et aL, Molecular Diversity 7:4-12, 1995). Both linkers can be 
attached via the carboxylic acid to an amine on the MOI. The attachment of the tag to the linker is made by forming 
an amide between a carboxylic acid on the tag and the amine on the linker. Cleavage of photolabile linkers is usually 
performed with UV light of 350 nm wavelength at intensities and times known to those in the art. Cleavage of the linkers 
results in liberation of a primary amide on the tag. Examples of photocleavable linkers include nitrophenyl glycine 

is esters, exo- and endo-2-benzonorbomeyl chlorides and methane sulfonates, and 3-amino-3(2-nitrophenyl) propionic 
acid. Examples of enzymatic cleavage include esterases which will cleave ester bonds, nucleases which will cleave 
phosphodiester bonds, proteases which cleave peptide bonds, etc. 

L<J I oOj ;-\ pi Li I i .if '.Au'i v>oi t ipUIiLil 1 L i Uo Jl J sJi if iu i iiii uuui i*_y , iJu.ui o ,x.-> o< hj ,Vi t i.uiU «v 

20 



d 



25 




N0 2 
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wherein one carbon atom at positions a, b, c, d or e is substituted with -L 3 -X, and U (which is preferably a direct bond) 
is present to the left of N(R 1 ) in the above structure. Such a linker component is susceptible to selective photo-induced 
cleavage of the bond between the carbon labeled "a" and N(R j ). The identity of R 1 is not typically critical to the cleavage 
reaction, however R 1 is preferably selected from hydrogen and hydrocarbyl. The present invention provides that in the 
os above structure. -N(R ] )- could be replaced with -O-. Also in the above structure, one or more of positions b t c, d or e 
may optionally be substituted with alkyl,.aikoxy, fluoride, chloride, hydroxy I, carboxylate or amide, where these sub- 
stituents are independently selected at each occurrence. 

[0151] A further preferred linker component with a chemical handle L h has the following structure: 



45 




SO 

wherein one or more of positions b. c, d or e is substituted with hydrogen, alkyl. alkoxy, fluoride, chloride, hydroxyl, 
carboxylate or amide, R 1 is hydrogen or hydrocarbyl, and R 2 is -OH or a group that either protects or activates a 
carboxylic acid for coupling with another moiety. Fluorocarbon and hydrofluorocarbon groups are preferred groups that 
activate a carboxylic acid toward coupling with another moiety. 

55 

3. Molecule of Interest (MOD 



[0152] Examples of MOIs include nucleic acids or nucleic acid analogues {e.g., PNA), fragments of nucleic acids (/. 
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e., nucleic acid fragments), synthetic nucleic acids or fragments, oligonucleotides (e.g., DNA or RNA), proteins, pep- 
tides, antibodies or antibody fragments, receptors, receptor ligands, members of a ligand pair, cytokines, hormones, 
oligosaccharides, synthetic organic molecules, drugs, and combinations thereof. 

[0153] Preferred MOIs include nucleic acid fragments. Preferred nucleic acid fragments are primer sequences that 
are complementary to sequences present in vectors, where the vectors are used for base sequencing. Preferably a 
nucleic acid fragment is attached directly or indirectly to a tag at other than the 3' end of the fragment; and most 
preferably at the 5* end of the fragment. Nucleic acid fragments may be purchased or prepared based upon genetic 
databases {e.g., Dib et aL Nature 330:152-154, 1996 and CEPH Genotype Database, http://www.cephb.fr) and com- 
mercial vendors (e.g., Promega, Madison, Wl). 

[0154] As used herein, MOI includes derivatives of an MOI that contain functionality useful in joining the MOI to a T- 
L-l h compound. For example, a nucleic acid fragment that has a phosphodiester at the 5' end, where the phosphodiester 
is also bonded to an alkyleneamine, is an MOI. Such an MOI is described in, e.g., U.S. Patent 4,762,779 which is 
incorporated herein by reference. A nucleic acid fragment with an internal modification is also an MOI. An exemplary 
internal modification of a nucleic acid fragment is where the base (e.g., adenine, guanine, cytosine, thymidine, uracil) 
has been modified to add a reactive functional group. Such internally modified nucleic acid fragments are commercially 
available from, e.g., Glen Research, Herndon, VA. Another exemplary internal modification of a nucleic acid fragment 
is where an abasic phosphoramidate is used to synthesize a modified phosphodiester which is interposed between a 
sugar and phosphate group of a nucleic acid fragment. The abasic phosphoramidate contains a reactive group which 
allows a nucleic acid fragment that contains this phosphoramidate-derived moiety to be joined to another moiety, e.g., 
a T-l-Lj, compound. Such abasic phosphoramidates are commercially available from, e.g., CJonetech Laboratories, 
Inc., Palo Alto, CA. 

4. Chemical Handles (L h ) 

[0155] A chemical handle is a stable yet reactive atomic arrangement present as part of a first molecule, where the 
handle can undergo chemical reaction with a complementary chemical handle present as part of a second molecule, 
so as to form a covalent bond between the two molecules. For example, the chemical handle may be a hydroxy! group, 
and the complementary chemical handle may bo a carboxylic acid group (or an activated derivative thereof, e.g., a 
hydrofluroaryl ester), whereupon reaction between these two handles forms a covalent bond (specifically, an ester 
group) that joins the two molecules together. 

[0156] Chemical handies may be used in a large number of covalent bond-forming reactions that are suitable for 
attaching tags to linkers, and linkers to MOIs. Such reactions include alkylation (e.g., to form ethers, thioethers), acyla- 
tion (e.g., to form esters, amides, carbamates, ureas, thioureas), phosphorylation {e.g., to form phosphates, phospho- 
nates, phosphoramides, phosphonamides), sulfonylation (e.g., to form sulfonates, sulfonamides), condensation (e.g., 
to form imines : oximes, hydrazcnes). silylation, disulfide formation, and generation of reactive intermediates, such as 
nitrenes or carbenes, by photolysis. In general, handles and bond-forming reactions which are suitable for attaching 
tags to linkers are also suitable lor attaching linkers to MOIs, and vice-versa, in some cases, the MOI may undergo 
prior modification or derivitizaticn to provide the handle needed for attaching the linker. 

[0157] One type of bond especially useful for attaching linkers to MOIs is the disulfide bond. Its formation requires 
the presence of a thiol group ("handle") on the linker, and another thiol group on the MO!. Mild oxidizing conditions 
then suffice to bond the two thiols together as a disulfide. Disulfide formation can also be induced by using an excess 
of an appropriate disulfide exchange reagent, e.g., pyridyl disulfides. Because disulfide formation is readily reversible, 
the disulfide may also be used as the cleavable bond for liberating the tag, if desired. This is typically accomplished 
under similarly mild conditions, using an excess of an appropriate thiol exchange reagent, e.g., dithiothreitol. 
[0158] Of particular interest for linking tags (or tags with linkers) to oligonucleotides is the formation of amide bonds. 
Primary aliphatic amine handles can be readily introduced onto synthetic oligonucleotides with phosphoramidites such 
as 6-monomethoxytn'tylhexylcyanoethyl-N,N-diisopropyl phosphoramidite (available from Glenn Research, Sterling, 
VA). The amines found on natural nucleotides such as adenosine and guanosine are virtually unreactive when com- 
pared to the introduced primary amine. This difference in reactivity forms the basis of the ability to selectively form 
amides and related bonding groups (e.g., ureas, thioureas, sulfonamides) with the introduced primary amine, and not 
the nucleotide amines. 

[0159] As listed in the Molecular Probes catalog (Eugene, OR), a partial enumeration of amine-reactive functional 
groups includes activated carboxylic esters, isocyanates, isothiocyanates. sulfonyl halides, and dichlcrotriazenes. Ac- 
tive esters are excellent reagents for amine modification since the amide products formed are very stable. Also, these 
reagents have good reactivity with aliphatic amines and low reactivity with the nucleotide amines of oligonucleotides. 
Examples of active esters include N-hydroxysuccmimide esters, pentaflucrophenyl esters, tetrafluorcphenyl esters, 
and p-nitrophenyl esters. Active esters are useful because they can be made from virtually any molecule that contains 
a carboxylic acid. Methods to make active esters are listed in Bodansky (Principles of Peptide Chemistry (2d ed.), 
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Springer Verlag, London, 1993), 
5. Linker Attachment 

[0160] Typically, a single type of linker is used to connect a particular set or family of tags to a particular set or family 
of MOIs. In a preferred embodiment of the invention, a single, uniform procedure may be followed to create all the 
various T-L-MOI structures. This is especially advantageous when the set of T-L-MOI structures is large, because it 
allows the set to be prepared using the methods of combinatorial chemistry or other parallel processing technology. 
In a similar manner, the use of a single, type of linker allows a single, uniform procedure to be employed for cleaving 
all the various T-L-MOI structures. Again, this is advantageous for a large set of T-L-MOI structures, because the set 
may be processed in a parallel, repetitive, and/or automated manner. 

[0161] There are, however, other embodiment of the present invention, wherein two or more types of linker are used 
to connect different subsets of tags to corresponding subsets of MOIs. In this case, selective cleavage conditions may 
be used to cleave each of the linkers independently, without cleaving the linkers present on other subsets of MOIs. 
[0162] A large number of covalent bond-forming reactions are suitable for attaching tags to linkers, and linkers to 
MOIs. Such reactions include aikylation (e.g., to form ethers, thioethers), acylation (e.g., to form esters, amides, car- 
bamates, ureas, thioureas), phosphorylation (e.g., to form phosphates, phosphonates, phosphoramides, phosphona- 
mides), sulfonylation {e.g., to form sulfonates, sulfonamides), condensation (e.g., to form imines, oximes, hydrazones), 
silylation, disulfide formation, and generation of reactive intermediates, such as nitrenes or carbenos, by photolysis. 
In general, handles and bond-forming reactions which are suitable for attaching tags to linkers are also suitable for 
attaching linkers to MOIs, and vice-versa. In some cases, the MOI may undergo prior modification or denvttization to 
provide the handle needed for attaching the linker. 

[0163] One type of bond especially useful for attaching linkers to MOIs is the disulfide bond. Its formation requires 
the presence of a thiol group ("handle") on the linker, and another thiol group on the MOI. Mild oxidizing conditions 
then suffice to bond the two thiols together as a disulfide. Disulfide formation can also be induced by using an excess 
of an appropriate disulfide exchange reagent, e.g., pyridyl disulfides. Because disulfide formation is readily reversible, 
the disulfide may also be used as the cleavable bond (or liberating the tag, if desired. This is typically accomplished 
under similarly mild conditions, using an excess of an appropriate thiol exchange reagent, e.g., dithiothreitol. 
[0164] Of particular interest for linking tags to oligonucleotides is the formation of amide bonds. Primary aliphatic 
amine handles can be readily introduced onto synthetic oligonucleotides with phosphoramidites such as 6-rnonometh- 
oxytritylhexylcyanoethyl-N,N-diisopropyi phosphoramidite (available from Glenn Research, Sterling, VA). The amines 
found on natural nucleotides such as adenosine and guanosine are virtually unreactive when compared to the intro- 
duced primary amine. This difference in reactivity forms the basis of the ability to selectively form amides and related 
bonding groups (e.g., ureas, thioureas, sulfonamides) with the introduced primary amine, and not the nucleotide 
aminos 

[0165] As listed in the Molecular Probes catalog (Eugene, OR), a partial enumeration of amine-reactive functional 
groups includes activated carboxylic esters, isocyanates, isothiocyanates, sulfonyl halides, and dichlorotriazenes. Ac- 
tive esters are excellent reagents for amine modification since the amide products formed are very stable. Also, these 
reagents have good reactivity with aliphatic amines and low reactivity with the nucleotide amines of oligonucleotides. 
Examples of active asters include N-hydroxysuccinimide esters, pentaflucrophenyl esters, tetrafluorophenyl esters, 
and p-nitrophenyl esters. Active esters are useful because they can be made from virtually any molecule that contains 
a carboxylic acid. Methods to make active esters are listed in Bodansky {Principles of Peptide Chemistry (2d ed.), 
Springer Verlag, London, 1993). 

[0166] Numerous commercial cross-linking reagents exist which can serve as linkers {e.g., see Pierce Cross-linkers, 
Pierce Chemical Co., Rockford, IL). Among these are homobifunctional amine-reactive cross-linking reagents which 
are exemplified by homobifunctional imidoesters and N-hydroxysuccinimidy! (NHS) esters. There also exist heterobi- 
functional cross-linking reagents possess two or more different reactive groups that allows for sequential reactions. 
Imidoesters react rapidly with amines at alkaline pH. NHS-esters give stable products when reacted with primary or 
secondary amines. Maleimides, alkyl and aryl halides, alpha-haloacyls and pyridyl disulfides are thiol reactive. Male- 
imides are specific for thiol (sulfhydryl) groups in the pH range of 6.5 to 7.5, and at alkaline pH can become amine 
reactive. The thioether linkage is stable under physiological conditions. Alpha-haloacetyl cross-linking reagents contain 
the iodoacetyl group and are reactive towards sulfhydryis. Imidazoles can react with the iodoacetyl moiety, but the 
reaction is very slow. Pyridyl disulfides react with thiol groups to form a disulfide bond. Ca rood iim ides couple carboxyls 
to primary amines ol hydrazides which give rises to the formation of an acyl-hydrazine bond. The aryiazides are pho- 
toaffinity reagents which are chemically inert until exposed to UVor visible light. When such compounds are photolyzed 
at 250-460 nm, a reactive aryl nitrene is formed. The reactive aryl nitrene is relatively non-specific. Glyoxals are reactive 
towards guanidinyi portion of arginine. 

[0167] In one typical embodiment of the present invention, a tag is first bonded to a linker, then the combination of 
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tag and linker is bonded to a MOI, to create the structure T-L-MOI. Alternatively, the same structure is formed by first 
bonding a linker to a MOI , and then bonding the combination of linker and MOI to a tag. An example is where the MOI 
is a DNA primer or oligonucleotide. In that case, the tag is typically first bonded to a linker, then the T-L is bonded to 
a DNA primer or oligonucleotide, which is then used, for example, in a sequencing reaction. 

[0168] One useful form in which a tag could be reversibly attached to an MOI (e.g., an oligonucleotide or DNA se- 
quencing primer) is through a chemically labile linker. One preferred design for the linker allows the linker to be cleaved 
when exposed to a volatile organic acid, for example, trifluoroacetic acid (TFA). TFA in particular is compatible with 
most methods of MS ionization, including electrospray. 

[01 69] As described in detail below, the invention provides methodology for genotyping. A composition which is useful 
in the genotyping method comprises a purality of compounds of the formula: 

T m3 -L-MOI 

wherein, 

T m3 is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula, L is an 
organic group which allows a T m3 -containing moiety to be cleaved from the remainder of the compound, wherein the 
T m3 -con taming moiety comprises a functional group which supports a single ionized charge state when the compound 
is subjected to mass spectrometry and is selected from tertiary amine, quaternary amine and organic and. In the 
formula, MOI is a nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3* end of the 
MOI. In the composition, at least two compounds have the same T™ 3 but the MOI groups of those molecules have 
non-identical nucleotide lengths. 

[0170] Another composition that is useful in the genotyping method comprises a plurality of compounds of the formula: 



wherein T™ 5 is an organic group detectable by mass spectrometry, comprising carbcn, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, pnosphorus and iodine. In the formula, L is an 
organic group which allows a T m3 -containing moiety to be cleaved from the remainder of the compound, wherein the 
T ms -containing moiety comprises a functional group which supports a single ionized charge state when the compound 
is subjected to mass spectrometry and is selected from tertiary amine, quaternany amine and organic acid. In the 
formula, MOI is a nucleic acid fragment wherein L is conjugated to the MCI at a location other than the 3" end of the 
MOI In the composition, at least two compounds have the same T m3 but those compounds have non-identical elution 
times by column chromatography. 

[0171] Another composition that may be used in the genotyping method comprises a plurality of compounds of the 
formula: 

T ms -L-MOI 

wherein T™ 3 is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula, L is an 
organic group which allows a T ms -containing moiety to be cleaved from the remainder of the compound, wherein the 
T ms -containing moiety comprises a functional group which supports a single ionized charge state when the compound 
is subjected to mass spectrometry and is selected from tertiary amine, quaternary amine and organic acid, in the 
formula, MOI is a nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3' end of the 
MOI. In the composition, no two compounds which have the same MOI nucleotide length also have the same T ms . 
f0172] In the above composition, the plurality is preferably greater than 2, and preferably greater than 4. Also, the 
nucleic acid fragment in the MOI have a sequence complementary to a portion of a vector, wherein the fragment is 
capable of priming polynucleotide synthesis. Preferably the T ms groups of members of the plurality differ by at least 2 
amu, and may differ by at least 4 amu. 

[0173] The invention also provides for a composition comprising a plurality of sets of compounds, each set of com- 
pounds having the formula: 



T -L-MOI 
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wherein T ma is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula, L is an 
organic group which allows a T m3 -containing moiety to be cleaved from the remainder of the compound, wherein the 
T m3 -containing moiety comprises a functional group which supports a single ionized charge state when the compound 
is subjected to mass spectrometry and is selected from tertiary amine, quaternary amine and organic acid. Also, in the 
formula, MOI is a nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3' end of the 
MOI. In the composition, members within a first set of compounds have identical Tms groups, however have non- 
identical MOI groups with differing numbers of nucleotides in the MOI and there are at least ten members within the 
first set, wherein between sets, the T ms groups differ by at least 2 amu. The plurality is preferably at least 3, and more 
preferably at least 5. 

[0174] The invention also provides for a composition comprising a plurality of sets of compounds, each set of com- 
pounds having the formula 



wherein, T ms is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula, L is an 
organic group which allows a T m3 -containmg moiety to be cleaved from the remainder of the compound, wherein the 
T^-containing moiety comprises a functional group which supports a single ionized charge state when the compound 
is subjected to mass spectrometry and is selected from tertiary amine, quaternary amine and organic acid. In the 
formula, MOI is a nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3' end of the 
MO! In the composition, the compounds within a set have the same elution time but non-identical T ms groups. 
[0175] In addition, the invention provides a kit for genotyping. The kit comprises a plurality of amplification primer 
pairs, wherein at least one of the primers has the formula; 

T mc -L-MOI 

wherein T ma is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygon, nitrogen, sulfur, phosphorus and iodine. In the formula, L is an 
organic group which allows a T m3 -containing moiety (o be cleaved from the remainder of the compound, wherein the 
T m3 -containing moiety comprises a functional group which supports a single ionized charge state when the compound 
is subjected to mass spectrometry and is selected from tertiary amine, quaternary amine and organic acid. In the 
formula. MO! is r nucleic acid fragment wherein L is conjugated to the MOI at a location other than the 3' end of the 
MOf; and each primer pair associates with a different loci In the kit, the pluality is preferably at least 3, and more 
preferably at least 5. 

[0176] As noted above, the present invention provides compositions and methods for determining the sequence of 
nucleic acid molecules. Briefly, such methods generally comprise the steps of (a) generating tagged nucleic acid frag- 
ments which are complementary to a selected nucleic acid molecule (e.g., tagged fragments) from a first terminus to 
a second terminus of a nucleic acid molecule), wherein a tag is correlative with a particular or selected nucleotide, and 
may be detected by any of a variety of methods, (b) separating the tagged fragments by sequential length, (c) cleaving 
a tag from a tagged fragment, and (d) detecting the tags, and thereby determining the sequence of the nucleic acid 
molecule. Each of the aspects will be discussed in more detail below. 

B. DIAGNOSTIC METHODS 

1 . Introduction 

[0177] As noted above, the present invention also provides a wide variety of methods wherein the above-described 
tags and/or linkers may be utilized in place of traditional labels (e.g., radioactive or enzymatic), in order to enhance 
the specificity, sensitivity, or number of samples that may be simultaneously analyzed, within a given method. Repre- 
sentative examples of such methods which may be enhanced include, for example, RNA amplification (see Lizardi et 
al., Bio/Technology 6:1197-1202, 1988; Kramer etal., Na ture 339:401 -402, 1989; LomeJi et al., Clinical Chem. 35(9): 
1826-1831, 1989; U.S. Patent No. 4,786,600), and DNA amplification utilizing LCR or polymerase chain reaction 
("PCR") (see, U.S. Patent Nos. 4,683,195, 4,683.202, and 4800,159). 

[01 78] Within one aspect of the present invention, methods are provided for determining the identity of a nucleic acid 
molecule or fragment (or for detecting the presence of a sefected nucleic acid molecule or fragment), comprising the 
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steps of (a) generating tagged nucleic acid molecules from one or more selected target nucleic acid molecules, wherein 
a tag is correlative with a particular nucleic acid molecule and detectable by non-fluorescent spectrometry or potenti- 
ometry, (b) separating the tagged molecules by size, (c) cleaving the tags from the tagged molecules, and (d) detecting 
the tags by non-fluorescent spectrometry or potentiometry, and therefrom determining the identity of the nucleic acid 
molecules. 

[0179] Within a related aspect of the invention, methods are provided for detecting a selected nucleic acid molecule, 
comprising the steps of (a) combining tagged nucleic acid probes with target nucleic acid molecules under conditions 
and for a time sufficient to permit hybridization of a tagged nucleic acid probe to a complementary selected target 
nucleic acid sequence, wherein a tagged nucleic acid probe is detectable by non-fluroescent spectrometry or potenti- 
ometry, (b) altering the size of hybridized tagged probes, unhybridized probes or target molecules, or the probelarget 
hybrids, (c) separating the tagged probes by size, (d) cleaving tags from the tagged probes, and (e) detecting tags by 
non-fluorescent spectrometry or potentiometry, and therefrom detecting the selected nucleic acid molecule. These, 
other related techniques are discussed in more detail below. 

2. PGR 

[0180] PCR can amplify a desired DNA sequence of any origin (virus, bacteria, plant, or human) hundreds of millions 
of times in a matter of hours. PCR is especially valuable because the reaction is highly specific, easily automated, and 
capable of amplifying minute amounts of sample. For these reasons, PCR has had a major impact on clinical medicine, 
genetic disease diagnostics, forensic science and evolutionary biology. 

[0181] Briefly, PCR is a process based on a specialized polymerase, which can synthesize a complementary strand 
to a giv^n DNA strand in a mixture containing the 4 DNA bases and 2 DNA fragments (primers, each about 20 bases 
long) flanking the target sequence. The mixture is heated to separate the strands of double- stranded DNA containing 
the target sequence and then cooled to allow (1 ) the primers to find and bind to their complementary sequences on 
the separated strands and (2) the polymerase to extend the primers into new complementary strands. Repeated heating 
and cooling cycles multiply the target DNA exponentially, since each new double strand separates to become two 
templates for further synthesis. In about 1 hour, 20 PCR cycles can amplify the target by a millionlold. 
[0182] Within one embodiment of the invention, methods are provided (or determining the identity of a nucleic acid 
molecule, or for detecting the selected nucleic acid molecule in, for example, a biological sample, utilizing the technique 
ol PCR. Briefly, such methods comprise the steps of generating a series of tagged nucleic acid fragments or molecules 
during the PCR and separating the resulting fragments are by size. The size separation step can be accomplished 
utilizing any of the techniques described herein, including for example gel electrophoresis {e.g., poly aery lam ide gel 
electrophoresis) or preferably HPLC. The tags are then cleaved from the separated fragments and detected by the 
respective detection technology. Examples of such technologies havo been described herein, and include for example 
mass spectrometry, infra-red spectrometry, potenticstatic amperometry or UV spectrometry. 

3. RNA Fingerprinting and Differential Display 

[0183] When the template is RNA, the first step in fingerprinting is reverse transcription. Liang and Pardee (Science 
257367, 1 992) were the first to describe an RNA fingerprinting protocol, using a primer for reverse transcription based 
on oligo (dT) but with an 'anchor' of two bases at the 5* end (e.g., oligo 5'-{dT u )CA-3\ Priming occurs mainly at. the 5' 
end of the poly(rA) tail and mainly in sequences that end 5'-UpG-poly(rA)-3\ with a selectivity approaching one out of 
12 polyadenylated RNAs. After reverse transcription and denaturation, arbitrary priming is performed on the resulting 
first strand of cDNA. PCR can now be used to generate a fingerprint of products that best matches the primers and 
that are derived from the 3' end of the mRNAs and polyadenylated heterogeneous RNAs. This protocol has been 
named 'differential display'. 

[0184] Alternatively, an arbitrary primer can be used in the first step of reverse transcription, selecting those regions 
internal to the RNA that have 6-8 base matches with the 3' end of the primer. This is followed by arbitrary priming of 
the resulting first strand of cDNA with the same or a different arbitrary primer and then PCR. This particular protocol 
samples anywhere in the RNA, including open reading frames (Welsh et al., Nua Acids. Res. 2(24965, 1992). In 
addition, it can be used on RNAs that are not polyadenylated. such as many bacterial RNAs. This variant of RNA 
fingerprinting by arbitrarily primed PCR has been called RAP-PCR. 

[0185] If arbitrarily primed PCR fingerprinting of RNA is performed on samples derived from cells, tissues or other 
biological material that have been subjected to different experimental treatments or have different developmental his- 
tories, differences in gene expression between the samples can be detected. For each reaction, it is assumed that the 
same number of effective PCR doubling events occur and any differences in the initial concentrations of cDNA products 
are preserved as a ratio of intensities in the final fingerprint. There are no meaningful relationships between the inten- 
sities of bands within a single lane on a gel, which are a function of match and abundance. However, the ratio between 
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lanes is preserved for each sampled RNA, allowing differentially expressed RNAs to be detected. The ratio of starting 
materials between samples is maintained even when the number of cycles is sufficient to allow the PCR reaction to 
saturate. This is because the number of doublings needed to reach saturation are almost completely controlled by the 
invariant products that make up the majority of the fingerprint. In this regard, PCR fingerprinting is different from con- 
ventional PCR of a single product in which the ratio of starting materials between samples is not preserved unless 
products are sampled in the exponential phase of amplification. 

[0186] Within one embodiment of the invention methods are provided for determining the identity of a nucleic acid 
molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, utilizing the technique 
of RNA fingerprinting. Briefly, such methods generally comprise the steps of generating a series of tagged nucleic acid 
fragments. The fragments generated by PCR or similar amplification schemes and are then subsequently separated 
by size. The size separation step can be, for example, any of the techniques described herein, including for example 
gel electrophoresis (e.g., polyacrylamide gel electrophoresis) or preferably HPLC. The tags are then cleaved from the 
separated fragments, and then the tags are detected by the respective detection technology. Representative examples 
of suitable technologies include mass spectrometry; infra-red spectrometry, potentiostatic amperometry or UV spec- 
trometry. The relative quantities of any given nucleic acid fragments are not important, but the size of the band is 
informative when referenced to a control sample. 

4. Fluorescence-Based PCR Single-Strand Conformation Polymorphism (PCR-SSCP) 

[0187] A number of methods in addition to the RFLP approach are available for analyzing base substitution poly- 
morphisms. Ortta, et al. have devised a way of analyzing these polymorphisms on the basis of conformational differ- 
ences in denatured DNA. Briefly, restriction enzyme digestion or PCR is used to produce relatively small DNA fragments 

which are then denatured and resolved by electrophoresis on non-denaturing polyacrylamide gels. Conformational 
differences in the single-stranded DNA fragments resulting from base substitutions are detected by electrophoretic 
mobility shifts. Intra-strand base pairing creates single strand conformations that are highly sequence-specific and 
distinctive in electrophoretic mobility. However, detection rates in different studies using conventional SSCP range from 
35% to nearly 100% with the highest detection rates most often requiring several different conditions. In principle, the 
method could also be used to analyze polymorphisms based on short insertions or deletions. This method is one of 
the most powerful toois for identifying point mutations and deletions in DNA (SSCP-PCR. Dean et al.. Cell 67:863, 
1990). 

[0188] Within one embodiment of the invention methods are provided for determining the identity of a nucleic acid 
molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, utilizing the technique 
of PCR-SSP. Briefly, such methods generally comprise the steps of generating a series of tagged nucleic acid frag- 
ments. The fragments generated by PCR are then separated by size. Preferably, the size separation step is non- 
denaturing and the nucleic acid fragments are denatured prior to the separation methodology. The size separation step 
can be accomplished, for example gel electrophoresis (e.g., polyacrylamide gel electrophoresis) or preferably HPLC. 
The tags are then cleaved from the separated fragments, and then the tags are detected by the respective detection 
technology (e.g., mass spectrometry, infra-red spectrometry, potentiostatic amperometry or UV spectrometry). 

5. Dideoxy Fingerprinting (ddF) 

[0189] Another method has been described (ddF, Sarkar et al., Genomics 73:441, 1992) that detected 100% of 
single-base changes in the human factor IX gene when tested in a retrospective and prospective manner. In total, 84 
of 84 different sequence changes were detected when genomic DNA was analyzed from patients with hemophilia B. 
[0190] Briefly, in the applications of tags for genotyping or other purposes, one method that can be used is dideoxy- 
fingerprinting. This method utilizes a dideoxy terminator in a Sanger sequencing reation. The principle of the method 
is as follows: a target nucleic acid that is to be sequenced is placed in a reaction which possesses a dideoxy -terminator 
complementary to the base known to be mutated in the target nucleic acid. For example, tf the mutation results in a 
A->G change, the reaction would bo carried cut in a C dideoxy-terminator reaction. PCR primers are used to locate 
and amplify the target sequence of interest. If the hypothetical target sequence contains the A->G change, the size of 
a population of sequences is changed due to the incorporation of a dideoxy-terminator in the amplified sequences, in 
this particular application of tags, a fragment would be generated which would possess a predictable size in the case 
of a mutation. The tags would be attached to the 5'-end of the PCR primers and provide a "map" to sample type and 
dideoxy-terminator type. A PCR amplification reaction would take place, the resulting fragments would be separated 
by size by for example HPLC or PAGE. At the end of the separation procedure, the DNA fragments are collected in a 
temporal reference frame, the tags are cleaved and the presence or absence of mutation is determined by the chain 
length due to premature chain terminator by the incorporation of a given dideoxy-terminator. 

[0191] It is important to note that ddf results in the gain or loss of a dideoxy-termination segment and or a shift in the 
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mobility of at least one of the termination segments or products. Therefore, in this method, a search is made of the 
shift of one fragment mobility in a high background of other molecular weight fragments. One advantage is the fore- 
knowledge of the length of fragment associated with a given mutation. 

[01921 Within one embodiment of the invention methods are provided for determining the identity of a nucleic acid 
5 molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, utilizing the technique 
of ddR Briefly, such methods generally comprise the steps of generating a series of tagged nucleic acid fragments, 
followed by separation based upon size. Preferably the size separation step is non-denaturing and the nucleic acid 
fragments aro denatured prior to the separation methodology. The size separation step can be accomplished, for 
example gel electrophoresis {e.g., polyacrylamide gel electrophoresis) or preferably HPLC. The tags are then cleaved 
to from the separated fragments, and then the tags are detected by the respective detection technology (e.g., mass 
spectrometry, infra-red spectrometry, potentiostatic amperometry or UV spectrometry). 

5. Restriction Maps and RFLPs 

is [0193] Restriction endonucleases recognize short DNA sequences and cut DNA molecules at those specific sites. 
Some restriction enzymes (rare-cutters) cut DNA very infrequently, generating a small number of very large fragments 
(several thousand to a million bp). Most enzymes cut DNA more frequently thus generating a large number of small 
fragments (less than a hundred to more than a thousand bp). On average, restriction enzymes with 4-base recognition 
sites will yield pieces 256 bases long, 6-base recognition sites will yield pieces 4000 bases long, and 6-base recognition 

20 sites will yiefd pieces 64,000 bases long. Since hundreds of different restriction enzymes have been characterized, 
DNA can be cut into many different small fragments. 

[0194] A wide variety of techniques have been developed for the analysis of DNA polymorphisms. The most wideiy 
used method, the restriction fragment length polymorphism (RFPL) approach, combines restriction enzyme digestion, 
gel electrophoresis, blotting to a membrane and hybridization to a cloned DNA probe. Polymorphisms are detected as 

2S variations in the lengths of the labeled fragments on the blots. The RFLP approach can be used to analyze base 
substitutions when the sequence change falls within a restriction enzyme site or to analyze minisatellites/VNTRs by 
choosing restriction enzymes that cut outside the repeat units. The agarose gels do not usually afford the resolution 
necessary to distinguish minisatollite/VNTR alleles differing by a single repeat unit, but many of the minisateliites/ 
VNTRs are so variable that highly informative markers can still be obtained. 

30 [0195] Within one embodiment of the invention methods are provided for determining the identity of a nucleic acid 
molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, utilizing the technique 
of restriction mapping or RFLPs. Briefly, such methods generally comprise the steps of generating a series of tagged 
nucleic acid fragments in which the fragments generated are digested with restriction enzymes. The tagged fragments 
are generated by conducting a hybridization step of the tagged probes with the digested target nucleic acid. The hy- 

3S bridization step can take place prior to or after the restriction nuclease digestion. The resulting digested nucleic acid 
fragments are then separated by size. The size separation step can be accomplished, for example gel electrophoresis 
(e.g., polyacrylamide gel electrophoresis) or preferably HPLC The tags are then cleaved from the separated fragments, 
and then the tags are detected by the respective detection technology (e.g., mass spectrometry, infra-red spectrometry, 
potentiostatic amperometry or UV spectrometry). 

40 

7 DNA Fingerprinting 

[0196] DNA fingerprinting involves the display of a set of DNA fragments from a specific DNA sample. A variety of 
DNA fingerprinting techniques are presently available (Jeffries et al„ Nature 314:67, 1985; Welsh and McClelland, 

-ts Nuc. Acids. Res. 75:861, 1991), most of which use PCR to generate fragments. The choice of which fingerprinting 
technique to use, is dependent on the application, e.g., DNA typing, DNA marker mapping and the organisms under 
investigation, e.g., prokaryotes, plants, animals, humans. A number of fingerprinting methods which meet these re- 
quirements have been developed over the past few years, including random amplified polymorphic DNA (RAPD). DNA 
amplification fingerprinting (DAF) and arbitrarily primed PCR (AP-PCR). These methods are all based on the amplifi- 

so cation of random genomic DNA fragments by arbitrarily selected PCR primers. DNA fragment patterns may be gener- 
ated of any DNA without prior sequence knowledge. The patterns generated depend on the sequence of the PCR 
primers and the nature of the template DNA. PCR is performed at lew annealing temperatures to allow the primers to 
anneal to multiple loci on the DNA. DNA fragments are generated when primer binding sites are within a distance that 
allows amplification. In principle, a single primer is sufficient for generating band patterns. 

$5 [0197] A new technique for DNA fingerprinting has been described, named AFLP (Vos et al. f Nuc. Acids Res. 23: 
4407, 1995). The AFLP technique is based on the detection of genomic restriction fragments by PCR amplification, 
and can be used for DN As of any origin or complexity Briefly, fingerprints are produced without prior sequence knowl- 
edge using a limited set of generic primers. The number of fragments detected in a single reaction can be "tunes" by 
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selection of specific primer sets, The AFLP technique is robust and reliable because stringent reaction conditions are 
used for primer annealing: the reliability of the RFLP technique is combined with the power of the PCR technique. 
[0198] Within one embodiment of the invention methods are provided for determining the identity of a nucleic acid 
molecule, or for detecting a selecting nucleic acid molecule, in, for example a biological sample, utilizing the technique 
5 of DNA fingerprinting. Briefly, such methods generally comprise the steps of generating a series of tagged nucleic acid 
fragments, followed by separation of the fragments by size. The size separation step can be accomplished, for example 
gel electrophoresis (e.g., polyacrylamide gel electrophoresis) or preferably HPLC. The tags are then cleaved from the 
separated fragments, and then the tags are detected by the respective detection technology (e.g., mass spectrometry, 
infra-red spectrometry, potentiostatic amperometry or UV spectrometry) 

w 

8. Application of Cleavable Tags to Genotyping and Polymorphism Detection 

a. Introduction 

>5 [0199] Although a few known human DNA polymorphisms are based upon insertions, deletions or other rearrange- 
ments of non-repeated sequences, the vast majority are based either upon single base substitutions or upon variations 
in the number of tandem repeats. Base substitutions are very abundant in the human genome, occurring on average 
once every 200-500 bp. Length variations in blocks of tandem repeats are also common in the genome, with at least 
tens of thousands of interspersed polymorphic sites (termed loci). Repeat lengths for tandem repeat polymorphisms 
20 range from 1 bp in (dA) n (dT) n sequences to at least 170 bp in a-satellite DNA. Tandem repeat polymorphisms can be 
divided into two major groups which consist of minisatellites/vanable number of tandem repeats (VNTRs), with typical 
repeat lengths of tens of base pairs and with tens to thousands of total repeat units, and microsatellites, with repeat 
lengths of up to 6 bp and with maximum total lengths of about 70 bp. Most of the microsateilite polymorphisms identified 
to dato have been based on (dC-dA) n or (dG-dT) n dinucleotide repeat sequences. Analysis of microsatollite polymer- 
y's phisms involves amplification by the polymerase chain reaction (PCR) of a small fragment of DNA containing a block 
of repeats followed by electrophoresis of the amplified DNA on denaturing polyacrylamide gel. The PCR primers are 
complementary to unique sequences that flank the blocks of repeats. Polyacrylamide gels, rather than agarose gels, 
are traditionally used for microsatellites because the alleles often only differ in size by a single repeat. 
[0200] Thus, within one aspect of the present invention methods are provided for genotyping a selected organism, 
Jo comprising the stops of (a) generating tagged nucleic acid molecules from a selected target molecule, wherein a tag 
is correlative with a particular fragment and may be detected by non-fluorescent spectrometry or potentiometn/, (b) 
separating the tagged molecules by sequential length, (c) cleaving the tag from the tagged molecule, and (d) detecting 
the tag by non-fluorescent spectrometry or potentiometry, and therefrom determining the genotype of the organism, 
[0201] Within another aspect, methods are provided for genotyping a selected organism, comprising the stops of (a) 
05 combining a tagged nucleic acid molecule with a selected target molecule under conditions and for a time sufficient to 
permit hybridization of the tagged molecule to the target molecule, wherein a tag is correlative with a particular fragment 
and may be detected by non-fluorescent spectrometry or potentiometry, (b) separating the tagged fragments by se- 
quential length, (c) cleaving the tag from the tagged fragment, and (d) detecting the tag by non-fluorescent spectrometry 
or potentiometry, and therefrom determining the genotype of the organism. 

40 

b. Application of cleavable tags to genotyping. 

[0202] A PCR approach to identify restriction fragment length polymorphism (RFPL) combines gel electrophoresis 
and detection of tags assoicated with specific PCR primers. In general, one PCR primer will possess one specific tag. 

45 The tag will therefore represent one set of PCR primers and therefore a pre-determined DNA fragment length. Poly- 
morphisms are detected as variations in the lengths of the labeled fragments in a gel or eluting from a gel. Polyacry- 
lamide gel electrophoresis will usually afford the resolution necessary to distinguish minisateiliteA/NTR alleles differing 
by a single repeat unit. Analysis of microsateilite polymorphisms involves amplification by the polymerase chain reaction 
(PCR) of a small fragment of DNA containing a block of repeats followed by electrophoresis of the amplified DNA on 

50 denaturing polyacrylamide gel or followed by separation of DNA fragments by HPLC. The amplified DNA will be labeled 
using primers that have cleavable tags at the 5' end of the primer The primers are incorporated into the newly synthe- 
sized strands by chain extension. The PCR primers are complementary to unique sequences that flank the blocks of 
repeats. Minisatellite/VNTR polymorphisms can also be amplified, much as with the microsatellites described above. 
[0203] Descriptions of many types of DNA sequence polymorphisms have provided the fundamental basis for the 

55 understanding of the structure of the human genome (Botstein et al., Am. J. Human Genetics 32:p314, 1980; Donis- 
Keller, Cell 51:319, 1987; Weissenbach et al., Nature 359.794). The construction of extensive framework linkage maps 
has been facilitated by the use of these DNA polymorphisms and has provided a practical means for localization of 
disease genes by linkage. Microsateilite dinucleotide markers are proving to be very powerful tools in the identification 
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of human genes which have bean shown to contain mutations and in some instances cause disease. Genomic dinu- 
cleotide repeats are highly polymorphic (Weber, 1990, Genomic Analysis, Vol 1, pp 159-181, Cold Spring Laboratory 
Press, Cold Spring Harbor, NY; Weber and Wong, 1993, Hum. Mol. Genetics, 2, p1123) and may possess up to 24 
alleles. Microsatellite dinucleotide ropeats can be amplified using primers complementary to the unique regions sur- 
rounding tho dinucleotide repeat by PCR. Following amplification, several amplified loci and be combined (multiplexed) 
prior to a size separation step. The process of applying the amplified microsatellite fragments to a size separation step 
and then idontifying the size and therefore the allele is known as gonotyping. Chromosome specific markers which 
permit a high level of multiplexing have been reported for performing whole genome scans for linkage analysis (Davies 
at al., 1994, Nature, 371, p1 30). 

[0204] Tags can be used to great effect in genotyping with microsatellites. Briefly, the PCR primers are constructed 
to carry tags and used in a carefully chosen PC reaction to amplify di-, tri-, or tetra- nucleotide repeats. The amplification 
products are then separated according to size by methods such as HPLC or PAGE. The DNA fragments are then 
collected in a temporal fashion, the tags cleaved from their respective DNA fragments and length deduced from com- 
parison to internal standards in the size separation step. Allele identification is made from reference to size of the 
amplified products. 

[0205] With cleavable tags approach to genotyping, it is possible to combine multiple samples on a single separation 
step. There are two general ways in which this can performed. The first general method for high through-put screening 
is the detection of a single polymorphism in a large group of individuals. In this senario a single or nested set of PCR 
primers is used and each amplification is done with one DNA sample type per reaction. The number of samples that 
can be combined in the separation step is proportional to the number of cleavable tags that can be generated per 
detection technology (i.e., 400-600 for mass spectrometer tags). It is therefore possible to identify 1 to several poly- 
morphisms in a large group of individuals simultaneously. The second approach is to use multiple sets of PCR primers 
which can identify numerous polymorphisms on a single DNA sample (genotyping an individual for example). In this 
approach PCR primers are combined in a single amplification reaction which generate PCR products of different length. 
Each primer pair or nested set is encoded by a specific cleavable Tag which implies each PCR fragment will be encoded 
witha specific tag. The reaction is run on a single separation step (see below) The number of samples that can be 
combined in the separation step is proportional to the number of cleavable tags that can be generated per detection 
technology (i.e . 400 -600 (or mass spectrometer tags) 

c. Enzymatic detection of mutation and the applications of tags. 

[0206] In this particular application or method, mismatches in heteroduplexes are detected by enzymatic cleavage 
of mismatched base pairs in a given nucleic acid duplex. DNA sequences to be tested for the presence of a mutation 
are amplified by PCR using a specific set of primers, the amplified products are denatured and mixed with denatured 
reference fragments and hybridized which result in the formation of heteroduplexes. The heteroduplexes are then 
iroated with enzymes which recognize and cleave the duplex if a mismatch is present. Such enzymes are nuclease 
S1 , Mung bean nuclease, Vesoivases', T4endonuclease IV, etc. Essentially any enzyme can be used which recognizes 
mismatches in vitro and cleave the resulting mismatch. The treatment with tho appropriate enzyme, the DNA duplexes 
are separated by size ; by, for example HPLC or PAGE. The DNA fragments are collected temporally. Tags are cleaved 
and detected. The presence of a mutation is detected by the shift in mobility of a fragments relative to a wiid-lype 
reference fragment. 

d. Applications of tags to the oligonucleotide ligation assay (OLA). 

[0207] The oligonucleotide ligation assay as originally described by Landegren ct al. (Landegen et al., Science 241: 
487, 1988) is a useful technique for the identification of sequences (known) in very large and complex genomes. The 
principle of the OLA reaction is based on the ability of ligase to covalently join two diagnostic oligonucleotides as they 
hybridize adjacent to one another on a given DNA target. If the sequences at the probe junctions are not perfectly 
based-paired, the probes will not be joined by the ligase The ability of a thermostable ligase to discriminate potential 
single base-pair differences when positioned at the 3' end of the "upstream" probe provides the opportunity for single 
base-pair resolution (Barony, PNAS USA 35:189, 1991). In the application of tags, the tags would be attached the 
probe which is ligated to the amplified product. After completion of the OLR, the fragments are separated on the basis 
of size, the tags cleaved and detected by mass spectrometry. 

e. Sequence specific ampfification. 

[0208] PC R primers with a 3' end complementary either to a mutant or normal oligonucleotide sequence can be used 

to selectively amplify one orthe other allele (Newton etal., Nucl. AcidsRes, 17:2503-15, 1 939, Nichols et al., Genomics 
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5:535-40, 1989; Okayama at al., J. Lab. Clin Med., 114:105-13, 1989; Sommer et al., Mayo Clin. Proc, 64:1361-72, 
1989; and Wu et al. Proa Natl Acad. Sci. USA, 86:2757-60, 1989). Usually the PCT products are visualized after 
amplification by PAGE, but the principle of sequence specific amplification can be applied to solid phase formats. 

f. Potential application of tags to some amplification based assays. 

[0209] Genotyping of viruses: A potential application of tags is the genotyping or identification of viruses by hybrid- 
ization with tagged probes. For example, F+ RNA coliphages may be useful candidates as indicators for enteric virus 
contamination. Genotyping by nucleic acid hybridization methods is a reliable, rapid, simple, and inexpensive alterna- 
tive to serotyping (Kafatos et. al., Nucleic Acids Res. 7 1541, 1979). Amplification techniques and nucleic aid hybridi- 
zation techniques have been successfully used to classify a variety of microorganisms including E. coli (Feng, Moi 
Cell Probes 7:151, 1993), rotavirus (Sethabutr et. al., J. Med Virol. 37:192, 1992), hepatitis C virus (Stuyver et. al., J. 
Gen Virol 74:1093, 1993), and herpes simplex virus (Matsumoto et. al., J. Virol. Methods 40. 11 9, 1992). 
[021 0] Prognostic applications of mutational analysis in cancers: Genetic alterations have been described in a variety 
of experimental mammalian and human neoplasms and represent the morphological basis for the sequence of mor- 
phological alterations observed in carcinogenesis (Vogelstein et al., NEJM 319.525, 1988). In recent years with the 
advent of molecular biology techniques, allelic losses on certain chromosomes or mutation of tumor suppressor genes 
as well as mutations in several oncogenes (e.g., c-myc, c-jun, and the ras family) have been the most studied entities. 
Previous work (Finkelstein et al., Arch Surg. 128:526, 1 993) has identified a correlation between specific types of point 
mutations in the K-ras oncogene and the stage at diagnosis in colorectal carcinoma. The results suggested that mu- 
tational analysis could provide important information of tumor aggressiveness, including the pattern and spread of 
metastasis. The prognostic value of TP53 and K-ras-2 mutational analysis in stage III carconoma of the colon has 
more recently been demonstrated (Pricolo et al., Am. J. Surg. 777:41, 1996). it is therefore apparent that genotyping 
of tumors and pre-cancerous cells, and specific mutation detection will become increasingly important in the treatment 
of cancers in humans. 

C SEPARATION OF NUCLEIC ACID FRAGMENTS 

[G211J A sample that requires analysis is often a mixture at many components in a complex matrix. For samples 
containing unknown compounds, the components must be separated from each other so that each individual compo- 
nent can be identified by other analytical methods. The separation properties oi Ihe components in a mixture are 
constant under constant conditions, and therefore once determined they can be used to identify and quantify each of 
the components. Such procedures are typical in chromatographic and electrophoretic analytical separations. 

1 . Hiqh-PQrformance Liquid Chromatography (HPLC) 

[0212] High-Performance liquid chromatography (HPLC) is a chromatographic separations technique to aop<j<, _uo 
compounds that are dissolved in solution. HPLC instruments consist of a reservoir of mobile phase, a pump, an injector, 
a separation column, and a detector. Compounds are separated by injecting an aliquot of the sample mixture onto the 
column. The different components in the mixture pass through the column at different rates due to differences in their 
partitioning behavior between the mobile liquid phase and the stationary phase. 

[0213] Recently, IP-RO-HPLC on non-porous PS/DV8 particles with chemically bonded alkyl chains have been 
shown to be rapid alternatives to capillary electrophoresis in the analysis of both single and double-strand nucleic acids 
providing similair degrees of resolution (Huber et al, Anal. Biochem. 212:351 1993; Huber et al., 1993, Nuc. Acids 
Res. 27:1061; Huber et al., Biotechniques /6:898, 1993). In contrast to ion-excahnge chromoatrography, which does 
not always retain double-strand DNA as a function of strand length (Since AT base pairs intereact with the positively 
charged stationary phase, more strongly than GC base-pairs), IP-RP-HPLC enables a strictly size-dependent sepa- 
ration. 

[0214] A method has been developed using 100 mM triethylammonium acetate as ion-pairing reagent, phosphodi- 
ester oligonucleotides could be successfully separated on alkylated non-porous 2.3 pM poly(styrene-divinylbenzene) 
particles by means of high performance liquid chromatography (Oefner et al., Anal. Biochem. 223.39, 1994). The 
technique described allowed the separation of PCR products differing only 4 to 8 base pairs in length within a size 

range of 50 to 200 nucleotides. 

2. Electrophoresis 

[0215] Electrophoresis is a separations technique that is based on the mobility of ions (or DNA as is the case de- 
scribed herein) in an electric field. Negatively charged DNA charged migrate towards a positive electrode and positively- 
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charged ions migrate toward a negative electrode. For safety reasons one electrode is usually at ground and the other 
is biased positively or negatively. Charged species have different migration rates depending on their total charge, size, 
and shape, and can therefore be separated. An electrode apparatus consists of a high-voltage power supply, electrodes, 
buffer, and a support for the buffer such as a polyacrylamide gel, or a capillary tube. Open capillary tubes are used for 
many types of samples and the other gel supports are usually used for biological samples such as protein mixtures or 
DNA fragments. 

3. Capillary Electrophoresis (CE) 

[0216] Capillary electrophoresis (CE) in its various manifestations (free solution, isotachophoresis, isoelectric focus- 
ing, polyacrylamide gel, micellar electrokinetic "chromatography") is developing as a method for rapid high resolution 
separations of very small sample volumes of complex mixtures. In combination with the inherent sensitivity and selec- 
tivity of MS, CE-MS is a potential powerful technique for bioanalysis. In the novel application disclosed herein, the 
interfacing of those two methods will lead to superior DNA sequencing methods that eclipse the current rate methods 
of sequencing by several orders of magnitude. 

[0217] The correspondence between CE and electrospray ionization (ESI) flow rates and the fact that both are fa- 
cilitated by (and primarily used for) ionic species in sclution provide the basis tor an extremely attractive combination. 
The combination of both capillary zone electrophoresis (CZE) and capillary isotachophoresis with quadrapole mass 
spectrometers based upon ESI have been described (Olivares et al., Anal. Cham. 55:1230, 1967; Smith et ai., Anal. 
Chem. 60:436, 1988; Loo et al. t Anal Chem. 179:404, 1989; Edmonds et a!., J Chroma. 474:2:, 1989; Loo et al. T J. 
Microcolumn Sep. 7:223, 1989; Lee et al., J Chromatog. 458:313, 1988; Smith et at., J. Chromatog. 4,90:211, 1989; 
Grose ot al., J Am. Chem. Soc 177:2335, 1989). Small peptides are easily amenable to CZE analysis with good (fom- 
tomole) sensitivity. 

[0218] The most powerful separation method for DNA fragments is polyacrylamide gel electrophoresis (PAGE), gen- 
erally in a slab gel format. However, the major limitation of the current technology is the relatively long time required 
to perform the gel electrophoresis of DNA fragments produced in the sequencing reactions. An increase magnitude 
(10-fold) can be achieved with the use of capillary electrophoresis which utilize ultrathin gels. In free solution to a first 
approximation ail DNA migrate with the same mobility as the addition of a base results in the compensation of mass 
and charge, in poiyacryiamide geis, DNA fragments sieve and migrate as a function of length and this approach has 
now been applied to CE. Remarkable plate number per meter has now been achieved with cross-linked polyacrylamide 
(10* 7 plates per meter, Cohen et al., Proc. Natl. Acad Set., USA 85:9660. 1988). Such CE columns as described can 
be employed for DNA sequencing. The method of CE is in principle 25 times faster than slab gel electrophoresis in a 
standard sequencer. For example, about 300 bases can be read per hour The separation speed is limited in slab gel 
electrophoresis by the magnitude of the electric field which can be applied to the gel without excessive heat production. 
Therefore, the greater speed of CE is achieved through the use of higher field strengths (300 V/crn in CE versus 10 
V/cm in slab gel electrophoresis). The capillary format reduces the amperage and thus power and the resultant heat 
generation. 

[0219] Smith and others (Smith et al., Nuc. Acids. Res. 78:4417, 1990) have suggested employing multiple capillaries 
in parallel to increase throughput. Likewise, Mathies and Huang (Mathies and Huang, Nature 359167, 1992) have 
introduced capillary electrophoresis in which separations are performed on a parallel array of capillaries and demon- 
strated high through-put sequencing (Huang et al., Anal. Chem. 64:967, 1992. Huang et aL, Anal. Chem. 64.2149, 
1992). The major disadvantage of capillary electrophoresis is the limited amount of sample that can be loaded onto 
the capillary. By concentrating a large amount of sample at the beginning of the capillary, prior to separation, loadability 
is increased, and detection levels can be lowered several orders of magnitude. The most popular method of precon- 
centration in CE is sample stacking. Sample stacking has recently been reviewed (Chien and Burgi, Anal. Chem. 64: 
489A, 1992). Sample stacking depends of the matrix difference, (pH, ionic strength) between the sample buffer and 
the capillary buffer, so that the electric field across the sample zone is more than in the capillary region. In sample 
stacking, a large volume of sample in a low concentration buffer is introduced for preconcentration at the head of the 
capillary column. The capillary is filled with a buffer of the same composition, but at higher concentration. When the 
sample ions reach the capillary buffer and the lower electric field, they stack into a concentrated zone. Sample stacking 
has increased detectabilities 1-3 orders of magnitude. 

[0220] Another method of preconcentration is to apply isotachophoresis (ITP) prior to the free zone CE separation 
of analytes. ITP is an electrophoretic technique which allows microliter volumes of sampfe to be loaded on to the 
capillary, in contrast to the low nL injection volumes typically associated with CE. The technique relies on inserting the 
sample between two buffers (leading and trailing electrolytes) of higher and lower mobility respectively, than the analyte. 
The technique is inherently a concentration technique, where the analytes concentrate into pure zones migrating with 
the same speed. The technique is currently less popular than the stacking methods described above because of the 
need for several choices of leading and trailing electrolytes, and the ability to separate only cationic or anionic species 
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during a separation process. 

[0221] The heart of the DNA sequencing process is the remarkably selective electrophoretic separation of DNA or 
oligonucleotide fragments. It is remarkable because each fragment is resolved and differs by only nucleotide. Sepa- 
rations of up to 1000 fragments (1000 bp) have been obtained. A further advantage of sequencing with cleavable tags 
is as follows. There is no requirement to use a slab gel format when DNA fragments are separated by polyacrylamide 
gel electrophoresis when cleavable tags are employed. Since numerous samples are combined (4 to 2000) there is 
no need to run samples in parallel as is the case with current dye-primer or dye-terminator methods (i.e., ABI373 
sequencer) Since there is no reason to run parallel lanes, there is no reason to use a slab gel. Therefore, one can 
employ a tube gel format for the electrophoretic separation method. Grossman (Grossman et al., Genet. Anal. Tech. 
AppL 99, 1992) have shown that considerable advantage is gained when a tube gel format is used in place of a slab 
gel format! This is due to the greater ability to dissipate Joule heat in a tube format compared to a slab gel which results 
in faster run times (by 50%), and much higher resolution of high molecular weight DNA fragments (greater than 1000 
nt). Long reads are critical in genomic sequencing. Therefore, the use of cleavable tags in sequencing has the additional 
advantage of allowing the user to employ the most efficient and sensitive DNA separation method which also possesses 
the highest resolution. 

4. Microfabricated Devices 

[0222] Capillary electrophoresis (CE) is a powerful method for DNA sequencing, forensic analysis, PCR product 
analysis and restriction fragment sizing. CE is far faster than traditional slab PAGE since with capillary gels a far higher 
potential field can be applied. However, CE has the drawback of allowing only one sample to be processed per gel. 
The method combines the faster separations times of CE with the ability to analyze multiple samples in parallel. The 
underlying concept behind the use of microfabricated devices is the ability to increase the information density in elec- 
trophoresis by miniaturizing the lane dimension to about 100 micrometers. The electronics industry routinely uses 
microfabrication to make circuits with features of less than one micron in size The current density of capillary arrays 
is limited the outside diameter of the capillary tube. Microfabrication of channels produces a higher density of arrays. 
Microfabrication also permits physical assemblies not possible with glass fibers and links the channels directly to other 
devices on a chip. Few devices have been constructed on microchips for separation technologies. A gas chromatograph 
(Terry et al., IEEE Trans. Electron Device, ED-26: 1880, 1 979) and a liquid chromatograph (Manz et al., Sons. Actuators 
87:249, 1990) have been fabricated on silicon chips, but these devices have not been widely used. Several groups 
have reported separating fluorescent dyes and amino acids on microfabricated devices (Manz et al., J. Chromatography 
593:253, 1992, Effenhauser et al., Anal. Chem. 55:2637, 1993) Recently Woolley and Mathies (Woolley and Mathies, 
Proc. Natl. Acad. Sci. 97:11343, 1994) have shown that photolithography and chemical etching can be used to make 
large numbers of separation channels on glass substrates. The channels are filled with hydroxyethyi cellulose (HEC) 
separation matrices. It was shown that DNA restriction fragments could be separated in as little as two minutes. 

D. CLEAVAGE OF TAGS 

[0223] As described above, different linker designs will confer cleavability ("lability") under different specific physical 
or chemical conditions. Examples of conditions which serve to cleave various designs of linker include acid, base, 
oxidation, reduction, fluoride, thiol exchange, photolysis, and enzymatic conditions. 

[0224J Examples of cleavable linkers that satisfy the general criteria for linkers listed above will be well known to 
those in the art and include those found in the catalog available from Pierce (Rockford. IL). Examples include: 

• ethylene glycobis(succinimidylsuccinate) (EGS), an amine reactive cross-linking reagent which is cleavable by 
hydroxylamine (1 M at 37°C for 3-6 hours); 

• disuccinimidyl tartarate (DST) and suifo-DST, which are amine reactive cross-linking reagents, cleavable by 0.015 
M sodium periodate; 

bis[2-(succinimidyloxycarbonyloxy)ethyl]sulfone (BSOCOES) and sulfo-BSOCOES, which are amine reactive 
cross-linking reagents, cleavable by base (pH 11.6); 

1,4-di-[3 , -(2'-pyridyldithio(propionamido))butane (DPDPB), a pyridyldithiol crosslinker which is cleavable by thiol 
exchange or reduction; 

N-[4-(p-azidosalicylamido)-butyl]-3 , -(2 , -pyridydithio)propionamide (APDP), a pyridyldithiol crosslinker which is 
cleavable by thiol exchange or reduction; 

• bis-[beta-4-(azidosalicylamido)ethyl]-disulfide, a photoreactive crosslinker which is cleavable by thiol exchange or 
reduction; 

• N-succinimidyl-(4-azidophenyl)-1 ,3'dithiopropionate (SADP), a photoreactive crosslinker which is cleavable by thi- 
ol exchange or reduction; 
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• sulfosuccinimidyl-2-(7-azido-4-methyfcoum (SAED), a photoreactive 
crosslinker which is cleavable by thiol exchange or reduction; 

• sulfosuccinimidyl^^m-aztdo-o-nitrobenzamidoj-ethyl-l^'dithiopropionate (SAND), a photoreactive crosslinker 
which is cleavable by thiol exchange or reduction. 

5 

[0225] Other examples of cleavable linkers and the cleavage conditions that can be used to release tags are as 
follows. A silyl linking group can be cleaved by fluoride or under acidic conditions. A 3-, 4-, 5-, or 6-substituted-2-ni- 
trobenzyloxy or 2-, 3-, 5-, or 6-substituted-4-nitrobenzyloxy linking group can be cleaved by a photon source (photol- 
ysis). A 3- ( 4-, 5-, or 6-substituted-2-alkoxyphenoxy or 2-. 3-, 5-, or 6-substituted-4-alkoxyphenoxy linking group can 

10 be cleaved by Ce(NH 4 )2(N0 3 ) 6 (oxidation). A NCO a (urethane) linker can be cleaved by hydroxide (base), acid f or 
LiAIH 4 (reduction). A 3-pentenyl, 2-butenyl, or 1-butenyl linking group can be cleaved by 0 3 , 0 3 0 4 /l0 4 % or KMn0 4 
(oxidation). A 2-{3-, 4-, or 5-substituted-furyl]oxy linking group can be cleaved by O a , Br 2 , MeOH, or acid. 
[0226] Conditions for the cleavage of other labiie linking groups include: t-alkyloxy linking groups can be cleaved by 
acid: methyl(dialkyl)methoxy or 4-substituted-2-alkyl-1,3-dioxiane-2-yl linking groups can be cleaved by H 3 0 + ; 2-si- 
lylethoxy linking groups can bo cleaved by fluoride or acid; 2-(X)-ethoxy (where X = keto, ester amide, cyano, N0 2 , 
sulfide, sulfoxide, sulfone) linking groups can be cleaved under alkaline conditions; 2-, 3-, 4-, 5-, or 6-substituted- 
benzyloxy linking groups can be cleaved by acid or under reductive conditions; 2-butenyloxy linking groups can be 
cleaved by (Ph 3 P) 3 RhCI(H), 3-, 4-, 5-, or 6-substituted-2-bromophenoxy linking groups can be cleaved by Li, Mg, or 
8uLi; methyithiomethoxy linking groups can be cleaved by Hg 2f ; 2-(X)-ethyfoxy (where X = a halogen) linking groups 

20 can be cleaved by Zn or Mg; 2-hydroxyethyloxy linking groups can be cleaved by oxidation (e.g., with Pb(OAc) 4 ). 

[0227] Preferred linkers are those that are cleaved by acid or photolysis. Several of the actd-labile linkers that have 
been developed for solid phase peptide synthesis are useful for linking tags (o MOIs. Some of these linkers are de- 
scribed in a recent review by Lloyd-Williams et al. (Tetrahedron 49:11065-11133, 1993). One useful type ot linker is 
based upon p-alkoxybenzyl alcohols, of which two, 4-hydroxymethylphenoxyacetic acid and 4-(4-hydroxymethyl- 

- 5 3-methoxyphenoxy)butyric acid, are commercially available from Advanced ChemTech (Louisville, KY). Both linkers 
can be attached to a tag via an ester linkage to the benzylaicohol, and to an amine-containing MOI via an amide linkage 
to the carboxylic acid. Tags linked by these molecules are released from the MOI with varying concentrations of trif- 
luoroacetic acid The cleavage of these linkers results in the liberation of a carboxylic acid on the tag. Acid cleavage 
of tags attached through related linkers, such as 2,4-dimethoxy-4 , -fcarboxymethyloxy)-benzhydrylamine (available 

JO from Advanced ChemTech in FMOC -protected form), results in liberation of a carboxylic amide on the released tag. 
[0228] The photolabile linkers useful for this application have also been for the most part developed for solid phase 
peptide synthesis (see Lloyd-Williams review). These linkers are usually based on 2-nitrobenzylesters or 2-nitroben- 
zylamides. Two examples of photolabile linkers that have recently been reported in the literature are 4-(4-(1-Fmoc- 
amino)ethyi)-2-methoxy-5-nitrophenoxy)butanoic acid (Holmes and Jones, J. Org. Chem, 602318-2319, 1995) and 

35 3-(Fmoc*amino)-3-(2-nitrophenyi)propionic acid (Brown et al., Molecular Diversity 7:4-12, 1995). Both linkers can be 
attached via the carboxylic acid to an amine on the MOI, The attachment of the tag to the linker is made by forming 
an amide between a carboxylic acid on the tag and the amine on the linker. Cleavage of photolabile linkers is usually 
performed with UV tight of 350 nm wavelength at intensities and times known to those in the art. Examples of commercial 
sources of instruments for photochemical cleavage are Aura Industries Inc. (Staten Island, NY) and Agrenetics (Wilm- 

4 o ington, MA). Cleavage of the linkers results in liberation of a primary amide on the tag. Examples of photocleavable 
linkers include nitrophenyl glycine esters, exo- and endo-2-benzonorborneyl chlorides and methane sulfonates, and 
3-amino-3(2-nitrophenyl) propionic acid. Examples of enzymatic cleavage include esterases which will cleave ester 
bonds, nucleases which will cleave phosphodiester bonds, proteases which cleave peptide bonds, etc. 

*s E. DETECTION OF TAGS 

[0229] Detection methods typically rely on the absorption and emission in some type of spectral field. When atoms 
or molecules absorb fight, the incoming energy excites a quantized structure to a higher energy level. The type of 
excitation depends on the wavelength of the light. Electrons are promoted to higher orbitals by ultraviolet or visible 

50 light, molecu lar vibrations are excited by infrared light, and rotations are excited by microwaves. An absorption spectrum 
is the absorption of light as a function of wavelength. The spectrum of an atom or molecule depends on its energy level 
structure. Absorption spectra are useful for identification of compounds. Specific absorption spectroscopic methods 
include atomic absorption spectroscopy (AA), infrared spectroscopy (IR), and UV-vis spectroscopy (uv-vis). 
[0230] Atoms or molecules that are excited to high energy levels can decay to lower levels by emitting radiation. 

55 This light emission is called fluorescence if the transition is between states of the same spin, and phosphorescence if 
the transition occurs between states of different spin. The emission intensity of an analyte is linearly proportional to 
concentration (at lowconcentrations), and is useful for quantifying the emitting species. Specific emission spectroscopic 
methods include atomic emission spectroscopy (AES).. atomic fluorescence spectroscopy (AFS), molecular iaser-in- 
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duced fluorescence (LIF), and X-ray fluorescence (XRF). 

[0231] When electromagnetic radiation passes through matter, most of the radiation continues in its original direction 
but a small fraction is scattered in other directions. Light that is scattered at the same wavelength as the incoming light 
is called Rayleigh scattering. Light that is scattered in transparent solids due to vibrations (phonons) is called Brillouin 
5 scattering. Brillouin scattering is typically shifted by 0. 1 to 1 wave number from the incident light. Light that is scattered 
due to vibrations in molecules or optical phonons in opaque solids is called Raman scattering. Raman scattered light 
is shifted by as much as 4000 wavenumbers from the incident light. Specific scattering spectroscopic methods include 
Raman spectroscopy. 

[0232] IR spectroscopy is the measurement of the wavelength and intensity of the absorption of mid-infrared light 

10 by a sample. Mid-infrared light (2.5 - 50 (im, 4000 - 200 cm' 1 ) is energetic enough to excite molecular vibrations to 
higher energy levels. The wavelength of IR absorption bands are characteristic of specific types of chemical bonds 
and IR spectroscopy is generally most useful for identification of organic and organometaliic molecules. 
[0233] Near-infrared absorption spectroscopy (NIR) is the measurement of the wavelength and intensity of the ab- 
sorption of near-infrared light by a sample. Near-infrared light spans the 800 nm - 2.5 u.m (12,500 - 4000 cm" 1 ) range 

<5 and is energetic enough to excite overtones and combinations of molecular vibrations to higher energy levels. NIR 
spectroscopy is typically used for quantitative measurement of organic functional groups, especially O-H, N-H, and 
C-O. The components and design of NIR instrumentation are similar to uv-vis absorption spectrometers. The light 
source is usually a tungsten lamp and the detector is usuajly a PbS solid-state detector. Sample holders can be glass 
or quartz and typical solvents are CCI 4 and CS 2 . The convenient instrumentation of NIR spectroscopy makes it suitable 

-0 for on-line monitoring and process control. 

[0234] Ultraviolet and Visible Absorption Spectroscopy (uv-vis) spectroscopy is the measurement of the wavelength 
and intensity of absorption of near-ultravioiet ~->d visible light by a sample. Absorption in the vacuum UV occurs at 
100-200 nm; (10 5 -50,000 cm' 1 ) quartz UV at 200-350 nm; (50,000-28,570 cm" 1 ) and visible at 350-800 nm; 
(28,570-12,500 cm" 1 ) and ss described by the Boer-Lambert-Bouguet law. Ultraviolet and visible light are energetic 

25 onough to promote outer electrons to higher energy levels. UV-\/is spectroscopy can be usually applied to molecules 
and inorganic ions or complexes in solution. The uv-vis spectra are limited by the broad features of the spectra. The 
light source is usually a hydrogen or deuterium lamp for uv measurements and a tungsten lamp for visible measure- 
ments. The wavelengths of these continuous light sources are selected with a wavelength separator such as a prism 
or grating monochromator. Spectra are obtained by scanning the wavelength separator and quantitative measurements 

>>o can be made from a spectrum or at a single wavelength 

[0235] Mass spectrometers use the difference in the mass-to-charge ratio (m/z) of ionized atoms or molecules to 
separate them from each other. Mass spectrometry is therefore useful for quantitation of atoms or molecules and also 
for determining chemical and structural information about molecules. Molecules have distinctive fragmentation patterns 
that provide structural information to identify compounds. The general operations of a mass spectrometer are as follows. 

■35 Gas-phase ions are created, the ions are separated in space or time based on their mass-to-chargo ratio, and the 
quantity of ions of each mass-to-charge ratio is measured. The ion separation power of a mass spectrometer is de- 
scribed by the resolution, which is defined as R = m / delta m, where m is the ion mass and delta m is the difference 
in mass between two resolvable peaks in a mass spectrum. For example, a mass spectrometer with a resolution of 
1000 can resolve an ion with a m/z of 100.0 from an ion with a m/z of 100.1. 

[0236] in general, a mass spectrometer (MS) consists of an ion source, a mass-selective analyzer, and an ion de- 
tector. The magnetic-sector, quadrupole. and time-of-flight designs also require extraction and acceleration ion optics 
to transfer ions from the source region into the mass analyzer. The details of several mass analyzer designs (for 
magnetic-sector MS, quadrupole MS or time-of-flight MS) are discussed below. Single Focusing analyzers for magnetic- 
sector MS utilize a particle beam path of 180, 90, or 60 degrees. The various forces influencing the particle separate 
ions with different mass-to-charge ratios. With double-focusing analyzers, an electrostatic analyzer is added in this 
type of instrument to separate particles with difference in kinetic energies. 

[0237] A quadrupole mass filter for quadrupole MS consists of four metal rods arranged in parallel. The applied 
voltages affect the trajectory of ions traveling down the flight path centered between the four rods For given DC and 
AC voltages, only ions of a certain mass-tc-charge ratio pass through the quadrupole filter and all other ions are thrown 
so out of their original path. A mass spectrum is obtained by monitoring the ions passing through the quadrupole filter as 
the voltages on the rods are varied. 

[0238] A time-of-flight mass spectrometer uses the differences in transit time through a "drift region" to separate ions 
of different masses. It operates in a pulsed mode so ions must be produced in pulses and/or extracted in pulses. A 
pulsed electric field accelerates all ions into a field-free drift region with a kinetic energy of qV, where q is the ion charge 
55 and V is the applied voltage. Since the ion kinetic energy is 0.5 mV 2 , lighter ions have a higher velocity than heavier 
ions and reach the detector at the end of the drift region sooner. The output of an ion detector is displayed on an 
oscilloscope as a function of time to produce the mass spectrum. 

[0239] The ion formation process is the starting point for mass spectrometry analyses. Chemical ionization is a 
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method that employs a reagent ion to react with the analyto molecules (tags) to form ions by either a proton or hydride 
transfer, The reagent ions are produced by introducing a large excess of methane (relative to the tag) into an electron 

impact ^Ei) ion source. Election collisions produce CH 4 ! an J Gri 3 * which further react with methane io form CH^ and 
C 2 H 5 f Another method to ionize tags is by plasma and glow discharge. Plasma is a hot, partially-ionized gas that 
effectively excites and ionizes atoms. A glow discharge is a low-pressure plasma maintained between two electrodes. 
Electron impact ionization employs an electron beam, usually generated from a tungsten filament, to ionize gas-phase 
atoms or molecules. An electron from the beam knocks an electron off analyte atoms or molecules to create ions. 
Electrospray ionization utilizes a very fine needle and a series of skimmers. A sample solution is sprayed into the 
source chamber to form droplets. The droplets carry charge when the exit the capillary and as the solvent vaporizes 
the droplets disappear leaving highly charged analyte molecules. ESI is particularly useful for large biological molecules 
that are difficult to vaporize or ionize. Fast-atom bombardment (FAB) utilizes a high-energy beam of neutral atoms, 
typically Xe or Ar, that strikes a solid sample causing desorption and ionization. It is used for large biological molecules 
that are difficult to get into the gas phase. FAB causes little fragmentation and usually gives a large molecular ion peak, 
making it useful for molecular weight determination. The atomic beam is produced by accelerating ions from an ion 
source though a charge-exchange cell. The ions pick up an electron in collisions with neutral atoms to form a beam of 
high energy atoms. Laser ionization (LIMS) is a method in which a laser pulse ablates material from the surface of a 
samole and creates a microplasma that ionizes some of the sample constituents. Matrix-assisted laser desorption 
ionization ^viAi^n i* a u .ViS moincu or vaporizing ana ionizing lafjo oioioyicai molecules sucn do pictoins or ON A 
fragments. The biological molecules are dispersed in a solid matrix such as nicotinic acid. A UV. laser pulse ablates 
the matrix which carries some of the large molecules into the gas phase in an ionizud 'irm so they can be extracted 
into a mass spectrometer. Plasma-desorption ionization (PD) utilizes the decay of 252 Cf which produces two fission 
fragments that travel in opposite directions. One fragment strikes the sample knocking out 1 -10 analyte ions. The other 
fragment strikes a detector and triggers the start of data acquisition This ionization method is especially useful for 
large biological molecules. Resonance ionization (RIMS) is a method in which one or more laser beams are tuned in 
resonance to transitions of a gas-phase atom or molecule to promote it in a stepwise fashion above its ionization 
potential to create an ion. Secondary ionization (SIMS) utilizes an ion beam; such as 3 He^, 16 O f , or 40 Ar + ; is focused 
onto the surface of a sample and sputters material into the gas phase. Spark source is a method which ionizes analytes 
in solid samples by pulsing an electric current across two electrodes. 

[0240] A tag may become charged prior to, during or after cleavage from the molecule to which it is attached. Ioni- 
zation methods based on ion "desorption", the direct formation or emission of ions from solid or liquid surfaces have 
allowod increasing application to nonvolatile and thermally labile compounds. These methods eliminate the need for 
neutral molecule volatilization prior to ionization and generally minimize thermal degradation of the molecular species. 
These methods include field desorption (Becky, Principles of Field Ionization and Field Desorption Mass Spectrometry, 
Pergamon, Oxford. 1977), plasma desorption (Sundqvist and Macfarlane, Mass Spectrom. Rev. 4:42), 1985), laser 
desorption (Karas and Hillenkamp, Anal. Chem. 60:2299, 1988; Karas etal., Angow. Chem. 707:305, 1989), fast particle 
bombardment (e.g., fast atom bombardment, FAB, and secondary ion mass spectrometry, SIMS, Barber et al., Anal. 
Chem. 54.645A, 1982), and thermospray (TS) ionization (Vestal, Mass Spectrom. Rev. 2:447, 1983). Thermospray is 
broadly applied for the on-line combination with liquid chromatography. The continuous flow FAB methods (Caprioli et 
al., Anal. Chem, 58.2949, 1986) have also shown significant potential. A more complete listing of ionization/mass 
spectrometry combinations is ion-trap mass spectrometry, electrospray ionization mass spectrometry, ion-spray mass 

zation mass spectrometry, metastabie atom bombardment ionization mass spectrometry, fast atom bomoard ionization 
mass spectrometry, MALDI mass spectrometry, , photo-ionizaticn time-of-flight mass spectrometry, laser droplet mass 
spectrometry, MALDI -TOF mass spectrometry, APCI mass spectrometry, nano-spray mass spectrometry, nebulised 
spray ionization mass spectrometry, chemical ionization mass spectrometry, resonance ionization mass spectrometry, 
secondary ionization mass spectrometry, thermospray mass spectrometry. 

[0241] The ionization methods amenable to nonvolatile biological compounds have overlapping ranges of applica- 
bility. Ionization efficiencies are highly dependent on matrix composition and compound type. Currently available results 
indicate that the upper molecular mass for TS is about 8000 daltons (Jones and Krolik, Rapid Comm. Mass Spectrom. 
7:67, 1987). Since TS is practiced mainly with quadrapole mass spectrometers, sensitivity typically suffers disporpor- 
tionately at higher mass-to-charge ratios (m/z). Time-of-rlight (TOF) mass spectrometers are commercially available 
and possess the advantage that the m/z range is limited only by detector efficiency. Recently, two additional ionization 
methods have been introduced. These two methods are new referred to as matrix-assisted laser desorption (MALDI, 
Karas and Hillenkamp, Anal. Chem. 602299, 1988; Karas et al.. Angew. Chem. 707:805, 1989) and electrospray 
ionization (ESI). Both methodologies have very high ionization efficiency (i.e., very high [molecular ions produced]/ 
[molecules consumed]). Sensitivity, which defines the ultimate potential of the technique, is dependent on sample size, 
quantity of ions, flow rate, detection efficiency and actual ionization efficiency. 

[0242] Electrospray-MS is based on an idea first proposed in the 1 960s (Doie et al., J. Chem. Phys. 49 2240. 1968). 
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Electrospray ionization (ESI) is one means to produce charged molecules lor analysis by mass spectroscopy. Briefly, 
electrospray ionization produces highly charged droplets by nebulizing liquids in a strong electrostatic field. The highly 
charged droplets, generally formed in a dry bath gas at atmospheric pressure, shrink by evaporation of neutral solvent 
until the charge repulsion overcomes the cohesive forces, leading to a "Coulombic explosion" The exact mechanism 

5 of ionization is controversial and several groups have put forth hypotheses (Blades et al., Anal. Chem. 63:2109-14, 
1991; Kebarleet al., Anal. Chem. 65.A972-86, 1993; Fenn, J. Am. Soc. Mass. Spectrom. 4:524-35, 1993). Regardless 
of the ultimate process of ion formation, ESI produces charged molecules from solution under mild conditions. 
[0243] The ability to obtain useful mass spectral data on small amounts of an organic molecule relies on the efficient 
production of ions. The efficiency of ionization for ESI is related to the extent of positive charge associated with the 

*o molecule. Improving ionization experimentally has usually involved using acidic conditions. Another method to improve 
ionization has been to use quaternary amines when possible (see Aebersold et al., Protein Science /:494-503, 1992; 
Smith et al., Anal. Chem. 60436-41, 1988). 

[0244] Electrospray ionization is described in more detail as follows. Electrospray ion production requires two steps; 
dispersal of highly charged droplets at near atmospheric pressure, followed by conditions to induce evaporation. A 
is solution of analyte molecules is passed through a needle that is kept at high electric potential. At the end of the needle, 
the solution disperses into a mist of small highly charged droplets containing the analyte molecules. The small droplets 
evaporate quickly and by a process of field desorption or residual evaporation, protcnated protein molecules are re- 

of liquid (generally 1-10 uL/min) from a capillary tube. A potential difference of 3-6 kV is typically applied between the 
20 capillary and counter electrode located 0.2-2 cm away (where ions, charged clusters, ano even charged droplets, 
depending on the extent of dosolvation, may be sampled by the MS through a small orifice). The electric field results 
in charge accumulation on the liquid surface at the capillary terminus: thus the liquid flow rate r resistivity, and surface 
tension are important factors in droplet production. The high electric field results \n disruption of the liquid surface and 
formation of highly charged liquid droplets. Positively or negatively charged droplets can be produced depending upon 
? 5 the capillary bias. The negative ion mode requires the presence of an electron scavenger such as oxygen to inhibit 
electrical discharge. 

[0245] A wide range a! liquids can be sprayed electrostatically into a vacuum, or with the aid of a nebulizing agent. 
The use of only electric fields for nobulization leads to some practical restrictions on the range of liquid conductivity 
and dielectric constant. Solution conductivity of less than 10~ 5 ohms is required at room temperature for a stable elec- 
w trospray at useful liquid flow rates corresponding to an aqueous electrolyte solution of < 10~ 4 M. In the mode found 
•res; useful for ESI -MS. in ape rcpii.Ho liquid flew .-ate a- -suits u\ Jicpai ;>ion of the liquid is a ;"ino mist. A short ais lance 
from the capillary the droplet diameter is often quite uniform and on the order of 1 urn. Of particular importance is that 
the total electrospray ion current increases only slightly for higher liquid flow rates. There is evidence that heating is 
useful for manipulating the electrospray. For example, slight heating allows aqueous solutions to be readily electro- 
ns sprayed, presumably due to the decreased viscosity and surface tension. Both thermally-assisted and gas-nebulization- 
assisted electrosprays allow higher liquid flow rates to be used, but decrease the extent of droplet charging. The for- 
mation of molecular iun^ requires conditions affecting evaporation of the initial droplet population. This can be accom- 
plished at higher pressures by a flow of dry gas at moderate temperatures (<60°C), by heating during transport through 
the interface, and (particularly in the case of ion trapping methods) by energetic collisions at relatively low pressure. 
4 o [0246] Although the detailea processes underlying ESI remain uncertain, the very small oropiets produced by ESI 
appear to allow almost any species carrying a net charge in solution to be transferred to the gas phase after evaporation 
oi resiouai solvent. Mass spectrcmetnc detection then requires that ions have a tractable m/z range (<4000 daltons 
for quadrupole instruments) after desolvatioa as well as to be produced and transmitted with sufficient efficiency. The 
wide range of solutes already found to be amenable to ESI-MS, and the lack of substantial dependence of ionization 
4 $ efficiency upon molecular weight, suggest a highly non-discriminating and broadly applicable ionization process. 

[0247] The electrospray ion "source" functions at near atmospheric pressure. The electrospray "source" is typically 
a metal or glass capillary incorporating a method for electrically biasing the liquid solution relative to a counter electrode. 
Solutions, typically water-methanoi mixtures containing the analyte and often other additives such as acetic acid, flow 
to the capillary terminus. An ESI source has been described (Smith et al., Anal. Chem. 62885, 1990) which can ac- 
50 commodate essentially any solvent system. Typical flow rates for ESI are 1-10 uL/min. The principal requirement of 
an ESi-MS interface is to sample and transport ions from the high pressure region into the MS as efficiently as possible. 
[0248] The efficiency of ESI can be very high, providing the basis for extremely sensitive measurements, which is 
useful for the invention described herein. Current instrumental performance can provide a total ion current at the de- 
tector of about 2 x 10" 12 A or about 10 7 counts/s for singly charged species. On the basis of the instrumental perforrn- 
$s ance, concentrations of as low as 10* 10 M or about 10* 18 mol/s of a singly charged species will give detectable ion 
current (about 10 counts/s) if the analyte is completely ionized. For example, lowattomole detection limits have been 
obtained for quaternary ammonium ions using an ESI interface with capillary zone electrophoresis (Smith et al., Anal. 
Chem 59 1 230. 1988) For a compound of molecular weight of 1 000 tho average number of charges is 1 , the approx- 
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imato number of charge states is 1. peak width (m/z) is 1 and the maximum intensity (ion/s) is 1 x 10 t2 
[0249] Remarkably little sample is actually consumed in obtaining an ESI mass spectrum (Smith et a!., Anal. Chem. 
60. 1 948, 1 988). Substantial gains might be also obtained by the use of array detectors with sector instruments, allowing 
simultaneous detection of portions of the spectrum. Since currently only about 10" s of all ions formed by ESI are de- 
tected, attention to the factors limiting instrument performance may provide a basis for improved sensitivity. It will be 
evident to those in the art that the present invention contemplates and accommodates for improvements in ionization 
and detection methodologies. 

[0250] An interface is preferably placed between the separation instrumentation (e.g., goljand the detector (e.g., 
mass spectrometer). The interface preferably has the following properties; (1) the ability to collect the DNA fragments 
at discreet time intervals, (2) concentrate the DNA fragments, (3) remove the DNA fragments from the electrophoresis 
buffers and milieu, (4) cleave the tag from the DNA fragment, (5) separate the tag from the DNA fragment, (6) dispose 
of the DNA fragment, (7) place the tag in a volatile solution, (S) volatilize and ionize the tag, and (9) place or transport 
the tag to an electrospray device that introduces the tag into mass spectrometer. 

[0251] The interface also has the capability of "collecting* DNA fragments as they elute from the bottom of a gel. 
The gel may be composed of a slab gel, a tubular gel, a capillary, etc. The DNA fragments can be collected by several 
methods. The first method is that of use of an electric field wherein DNA fragments are collected onto or near an 
electrode. A second method is that wherein the DNA fragments are collected by flowing a stream of liquid past the 
bottom of a gel. Aspects of both methods can be combined wherein DNA collected into a flowing stream which can be 
later concentrated by use of an electric field. The end result is that DNA fragments are removed from the milieu under 
which the separation methoo was performed. That is, DNA fragments can be "dragged" from one solution type to 
another by use of an electric field. 

[0252] Once the DNA fragments are in the appropriate solution (compatible with electrospray and mass spectrometry) 
the tag can bo cleaved from the DNA fragment. The DNA fragment (or remnants thereof) can then be separated from 
the tag by the application of an electric field (preferably, the tag is of opposite charge of that of the DNA tag). The tag 
is then introduced into the electrospray device by the use of an electric field or a flowing liquid. 

[0253] Fluorescent tags can be identified and quantitated most directly by their absorption and fluorescence emission 
wavelengths and intensities. 

[0254] While a conventional spoctrcfluoromctor is extremely flexible, providing continuous ranges of excitation and 
emission wavelengths M^, 1 31 , 1 32 ), more specialized instruments such as flow cytcmeters and laser-scanning mi- 
croscopes require probes that are excitable at a single fixed wavelength. In contemporary instruments, this is usually 
the 488-nm line of the argon laser. 

[0255] Fluorescence intensity per probe molecule is proportional to the product of e and QY. The range of those 
parameters among fluorophores of current practical importance is approximately 10,000 to 100,000 cm^M* 1 fort: and 
0. 1 to 1 .0 for QY When absorption is driven toward saturation by high-intensity illumination, the irreversible destruction 
of the excited fluorophore (photobleaching) becomes the factor limiting fluorescence detectability The practical impact 
of photobleaching depends on the fluorescent detection technique in question. 

[0256] It will be evident to one in the art that a device (an interface) may be interposed between the separation and 
detection steps to permit the continuous operation of size separation and tag detection (in real time). This unites the 
separation methodology and instrumentation with the detection methodology and instrumentation forming a single 
device. For example, an interface is interposed between a separation technique and detection by mass spectrometry 
or potentiostatic amperometry. 

[0257] The function of the interface is primarily the release of the (e.g., mass spectrometry) tag from analyte. There 
are several representative implementations of the interface. The design of the interface is dependent on the choice of 
cleavable (inkers. In the case of light or photo-cieavable linkers, an energy or photon source is required. In the case 
of an acid-labile linker, a base-labile linker, or a disulfide linker, reagent addition is required within the interface. In the 
case of heat-labile linkers, an energy heat source is required. Enzyme addition is required for an enzyme-sensitive 
linker such as a specific protease and a peptide linker, a nuclease and a DNA or RNA linker, a glycosyiase, HRP or 
phosphatase and a linker which is unstable after cleavage (e.g., similiar to chemiluminescent substrates). Other char- 
acteristics of the interface include minimal band broadening, separation of DNA from tags before injection into a mass 
spectrometer. Separation techniques include those based on electrophoretic methods and techniques, affinity tech- 
niques, size retention (dialysis), filtration and the like. 

[0258] It is also possible to concentrate the tags (or nucleic acid-linker-tag construct), capture electrophoretically, 
jnd then release into aits mate reagent stream which ;s oampuus'o with the particular type of ionization method selected. 
The interface may also be capable of capturing the tags (or nucleic acid-linker-tag construct) on microbeads, shooting 
the bead(s) into chamber and then preforming laser desorption/vaporization. Also it is possible to extract in flow into 
alternate buffer (e.g., from capillary electrophoresis buffer into hydrophobic buffer across a permeable membrane). It 
may also be desirable in some uses to deliver tags into the mass spectrometer intermittently which would comprise a 
further function of the interface. Another function of the interface is to deliver tags from muftiple columns into a mass 
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spectrometer, with a rotating time slot for each column. Also, it is possible to deliver tags from a single column into 
multiple MS detectors, separated by time, collect each set of tags for a few milliseconds, and then deliver to a mass 
spectrometer. 

[0259] The following is a list of representative vendors for separation and detection technologies which may be used 
in the present invention. Hoefer Scientific Instruments (San Francisco, CA) manufactures electrophoresis equipment 
^Two Step ,M , Poker Face rM II) tor sequencing applications. Pharmacia Biotech (Piscataway. NJ) manufactures elec- 
trophoresis equipment for DNA separations and sequencing (PhastSystem for PCR-SSCP analysis, MacroPhor Sys- 
tem for DNA sequencing). Perkin Elmer/Applied Biosystems Division (ABI, Foster City, CA) manufactures semi-auto- 
mated sequencers based on fluorescent-dyes (ABI373 and ABI377). Analytical Spectral Devices (Boulder, CO) man- 
ufactures UV spectrometers. Hitachi Instruments (Tokyo, Japan) manufactures Atomic Absorption spectrometers, Flu- 
orescence spectrometers, LC and GC Mass Spectrometers. NMR spectrometers, and UV-VIS Spectrometers. PerSep- 
tive Biosystems (Framingham, MA) produces Mass Spectrometers (Voyager™ Elite). Bruker Instruments Inc. (Manning 
Park, MA) manufactures FTIR Spectrometers (Vector 22), FT-Raman Spectrometers, Time of Flight Mass Spectrom- 
eters (Reflex II™), Ion Trap Mass Spectrometer (Esquire rM ) and a Matdi Mass Spectrometer Analytical Technology 
Inc. (ATI, Boston, MA) makes Capillary Gel Electrophoresis units, UV detectors, and Diode Array Detectors. Teledyne 
Electronic Technologies (Mountain View, CA) manufactures an Ion Trap Mass Spectrometer (3DQ Discovery™ and 
the 3DQ Apogee™). Perkin Elmer/Applied Biosystems Division (Foster City, CA) manufactures a Sctex Mass Spec- 
Clara, CA) produces Mass Selective Detectors (HP 5972A), MALDI-TOF Mass Spectrometers (HP G2025A), Diode 
Array Detectors, CE units, HPLC units (HP1090) as well as UV Spectrometers. Finnigan Corporation (San Jose, CA) 
manufactures mass spectrometers (magnetic sector (MAT 95 5 rM ), quadrapoio spectrometers (MAT 95 SQ rM ) and 
four other related mass spectrometers). Rainin (Emeryville, CA) manufactures HPLC instruments. 
[0260] The methods and compositions described herein permit the use of cleaved tags to serve as maps to particular 
sample typo and nucleotide identity. At the beginning of each sequencing method, a particular (selected) primer is 
assigned a particular unique tag. The tags map to either a sample type, a dideoxy terminator type (in the case of a 
Sanger sequencing reaction) or preferably both. Specifically, the tag maps to a primer type which in turn maps to a 
vector typo which in turn maps to a sample identity. Tho tag may also may map to a dideoxy terminator type (ddTTR 
ddCTP, ddGTP, ddATP) by reference into which dideoxynuclectide reaction the tagged primer is placed. The sequenc- 
ing reaction \s then performed and the resulting fragments are sequentially separated by size in time 
[0261] The tags are cleaved from the fragments in a temporal frame and measured and recorded in a temporal frame. 
The sequence is constructed by comparing the tag map to the temporal frame. That is, all tag identities are recorded 
in time after the sizing step and related become related to one another in a temporal frame. The sizing step separates 
the nucleic acid fragments by a one nucleotide increment and nonce the related tag identities are separated by a one 
nucleotide increment. By foreknowledge of the dideoxy-termmator or nucleotide map and sample type, the sequence 
is readily deduced in a linear fashion. 

[0262] The following examples are offered by way of illustration, and not by way of limitation. 

[0263] Unless otherwise stated, chemicals as used in the examples may be obtained from Aldrich Chemical Com* 

pany, Milwaukee, Wl. The following abbreviations, with the indicated meanings, are used herein: 

ANP = 3-(Fmoc-amino)-3-(2-nitrcphenyl)propionic acid 
NBA - 4-(Fmoc-aminomethy1)-3-nitrobenzoic acid 

HATU = 0-7-azabenzotriazol-1-yl-N.N,N',N'-tetramethyluronium hexafluorophosphate 
DIEA = diisopropylethylamine 
MCT - monochlorotriazine 
NMM = 4-methylmorpholine 
NMP = N-methylpyrrolidone 

ACT357 = ACT357 peptide synthesizer from Advanced ChemTech, Inc., Louisville. KY 
ACT= Advanced ChemTech ; Inc., Louisville, KY 

NovaBiochem - CaJBicchem-NovaBiochem international, San Diego, CA 

TFA = Triflucroacetic acid 

Tfa = Trifluoroacetyl 

iNIP = N-Methylisonipecotic acid 

Tfp = Tetrafluorophenyl 

DIAEA = 2-(Diisopropylamino)ethylamine 

MCT = monochlorotriazene 

S'-AH-ODN = S'-aminohexyi-tailed oligodeoxynucleotide 
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EXAMPLES 
EXAMPLE 1 

5 PREPARATION OF ACID LABILE LINKERS FOR USE IN CLEAVABLE-MW-IDENTIFIER SEQUENCING 

A. Synthesis of Pentafluorophenyl, Esters of Chemically Cleavable Mass Spectroscopy Tags, to Liberate Tags with 
Carboxyl Amide Termini 

10 [0264] Figure 1 shows the reaction scheme. 

Step A. TentaGel S AC resin (compound II; available from ACT; 1 eq.) is suspended with DMF in the collection vessel 
of the ACT357 peptide synthesizer (ACT). Compound 1 (3 eq.), HATU (3 eq.) and DIEA (7.5 eq.) in DMF are added 
and the collection vessel shaken for 1 hr. The solvent is removed and the resin washed with NMP (2X), MeOH (2X), 
and DMF (2X) The coupling of I to the resin and the wash steps are repeated, to give compound III. 

75 Step B. The resin (compound III) is mixed with 25% piperidine in DMF and shaken for 5 min. The resin is filtered, then 
mixed with 25% piperidine in DMF and shaken for 10 min. The solvent is removed, the resin washed with NMP (2X), 
MeOH (2X), and DMF (2X), and used directly in step C. 

Step C. The deprotected resin from step B is suspended in DMF and to it is added an FMOC-protected amino acid, 
containing amine functionality in its side chain (compound IV, e.g. alpha-N-FMOC-3-(3-pyridyi)-alanine, available from 
20 Synthetech. Albany, OR; 3 eq.), HATU (3 eq.), and DIEA (7.5 eq.) in DMF. The vessel is shaken for 1 hr. The solvent 
is removed and the resin washed with NMP (2X), MeOH (2X), and DMF (2X). The coupling of IV to the resin and the 
wash steps are repeated, to give compound V 

Step D. The rosin (compound V) is treated with piperidine as described in step B to remove the FMOC group. The 
deprotected resin is then divided equally by the ACT357 from the collection vessel into 16 reaction vessels. 

23 Step E. The 16 aliquots of deprotected resin from step D are suspended in DMF. To each reaction vessel is added the 
appropriate carboxylic acid Vl t . 16 (R^gCOoH; 3 eq.), HATU (3 eq ), and DIEA (7.5 eq.) in DMF. The vessels are shaken 
for 1 hr The solvent is removed and the aliquots of resin washed with NMP (2X), MeOH (2X), and DMF (2X). The 
coupling ol VI M6 to the aliquots of resin and the wash steps are repeated, to give compounds VII M5 . 
Step F. The aliquots of resin (compounds VII M6 ) are washed with CH 2 CI 2 (3X). To each of the reaction vessels is 

30 added 1% TFA in CH 2 CI 2 and the vessels shaken for 3G min. The solvent is filtered from the reaction vessels into 
individual tubes. The aliquots of resin are washed with CH 2 C! 2 (2X) and MeOH (2X) and the filtrates combined into the 
individual tubes. The individual tubes are evaporated in vacuo, providing compounds VIII M6 . 

Step G Each of the free carboxylic acids VII M6 is dissolved in DMF, To each solution is added pyridine (1.05 eq.), 
followed by pentafluorophenyl trifluoroacetate (1 .1 eq ) The mixtures are stirred for 45 min. at room temperature. The 
J$ solutions are diluted with EtOAc, washed with 1 M aq. citric acid (3X) and 5% aq. NaHCO G (3X), dried over Na^SO^ 
filtered, and evaporated in vacuo, providing compounds IX^q. 

8. Synthesis of Pentafluorophenyl Esters of Chemically Cleavable Mass Spectroscopy Tags, to Liberate Tags with 
Carboxyl Acid Termini 

[0265] Figure 2 shows the reaction scheme. 

Step A. 4-(Hydroxymethyl)phenoxybutyr!c acid (compound I; 1 eq.) is combined with DIEA (2.1 eq.) and allyl bromide 
(2.1 eq.) in CHCI 3 and heated to reflux for 2 hr. The mixture is diluted with EtOAc, washed with 1 N HCI (2X), pH 9.5 
carbonate buffer (2X), and brine (1 X), dried over Na 2 S0 4 , and evaporated in vacuo to give the ally! ester of compound I. 

45 Step B. The allyl ester of compound I from step A (1.75 eq.) is combined in CH 2 CI 2 with an FMOC-protected amino 
acid containing amine functionality in its side chain (compound IL e.g. alpha-N-FMOC-3-(3-pyridyl)-alanine, available 
from Synthetech, Albany, OR; 1 eq.), N-methylmorpholine (2.5 eq.), and HATU (1 . 1 eq.), and stirred at room temperature 
for 4 hr. The mixture is diluted with CH 2 CI 2 , washed with 1 M aq. citric acid (2X), water (1 X), and 5% aq. NaHC0 3 (2X), 
dried over Na 2 S0 4 , and evaporated in vacuo. Compound III is isolated by flash chromatography (CH 2 CI 2 -> EtOAc). 

50 Step C. Compound III is dissolved in CH 2 CI 2 , Pd(PPh 3 ) 4 (0.07 eq.) and N-methylaniline (2 eq.) are added ; and the 
mixture stirred at room temperature for 4 hr The mixture is diluted with CH 2 Ck washed with 1 M aq. citric acid (2X) 
and water (1X), dried over Na 2 S0 4 , and evaporated in vacuo. Compound IV is isolated by flash chromatography 
(CH 2 CI 2 -> EtOAc + HOAc). 

Step D. TentaGel S AC resin (compound V; 1 eq.) is suspended with DMF in the collection vessel of the ACT357 
ss peptide synthesizer (Advanced ChemTech Inc. (ACT), Louisville, KY). Compound IV (3 eq.), HATU (3 eq.) and DIEA 
(7.5 eq.) in DMF are added and the collection vessel shaken for 1 hr. The solvent is removed and the resin washed 
with NMP (2X), MeOH (2X). and DMF (2X). The coupling of IV to the resin and the wash steps are repeated, to give 
compound VI. 
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Step E. Tho rosin (compound VI) is mixed with 25% piperidino in DMF and shaken for 5 mm. The resin is filtered, then 
mixed with 25% piperidine in DMF and shaken for 10 min. The solvent is removed and the resin washed with NMP 
(2X), MeOH (2X), and DMF (2X). The deprotected resin is then divided equally by the ACT357 from the collection 
vessel into 1 6 reaction vessels. 

Step F. The 16 aliquots of deprotected resin from step E are suspended in DMF To each reaction vessel is added the 
appropriate carboxylic acid VII ^ _ te (R MS C0 2 H; 3 eq.), HATU (3 eq.), and DIEA (7.5 eq.) in DMF. The vessels are 
shaken for 1 nr. The solvent is removed and the aliquots of resin washed with NMP (2X), MeOH (2X), and DMF (2X). 
The coupling of VII,. 16 to the aliquots of resin and the wash steps are repeated, to give compounds Vill M6l . 
Step G. The aliquots of resin (compounds VIN^g) are washed with CH 2 CI 2 (3X). To each of the reaction vessels is 
added 1% TFA in CH 2 CI 2 and the vessels shaken for 30 min. The solvent is filtered from the reaction vessels into 
individual tubes. The aliquots of resin are washed with CH 2 CI 2 (2X) and MeOH (2X) and the filtrates combined into the 
individual tubes. The individual tubes are evaporated in vacuo, providing compounds IX M6 . 

Step H. Each of the free carboxylic acids 1X V16 is dissolved in DMF. To each solution is added pyridine (1.05 eq.), 
followed by pentafluorophenyl trifluoroacetate (1.1 eq.). The mixtures are stirred for 45 min. at room temperature. The 
solutions are diluted with EtOAc, washed with 1 M aq. citric acid (3X) and 5% aq. NaHCO 0 (3X), dried over Na 2 S0 4> 
filtered, and evaporated in vacuo, providing compounds X V15 . 

*\.im rr 



DEMONSTRATION OF PHOTO LYTIC CLEAVAGE OF T-L-X 

[0266] A T-L-X compound as prepared in Example 1 1 was irradiated with near-UV light for , mir at room temperature. 
A Rayonett fluorescence UV lamp (Southern New England Ultraviolet Co., Middletown. CT) with an emission peak at 
350 nrn is used as a source of UV light. The lamp is placed at a 15-crn distance from the Potri dishes with samples. 
SDS gei electrophoresis shows that >85% of the conjugate is cleaved under these conditions. 

EXAMPLE 3 

PREPARATION OF FLUORESCENT LABELED PRIMERS AND DEMONSTRATION OF CLEAVAGE OF 
FLUOROPHORE 

Synthesis and Purification of Oligonucleotides 

[0267] The oligonucleotides (ODNs) are prepared on automated DNA synthesizers using the standard phosphora- 
midite chemistry supplied by the vendor, or the H-phosphonate chemistry (Glenn Research Sterling, VA). Appropriately 
blocked dA. dG, dC, and T phosphoramidites are commercially available in these forms, and synthetic nucleosides 
may readily be converted to the appropriate form. The oligonucleotides are prepared using tho standard phosphora- 
midite supplied by the vendor, orthe H-phosphonate chemistry. Oligonucleotides are purified by adaptations of standard 
methods. Oligonucleotides with S'-trityl groups are chromatographed on HPLC using a 12 micrometer, 300 # Rainin 
(Emeryville, CA) Dynamax C-8 4.2x250 mm reverse phase column using a gradient of 15% to 55% MeCN in 0.1 N 
Et 3 NHOAc~, pH 7.0, over 20 min. When detritylation is performed, the oligonucleotides are further purified by gel 
exclusion chromatography. Analytical checks for the quality of the oligonucleotides are conducted with a PRP-column 
(Alltech, Deerfield, IL) at alkaline pH and by PAGE. 

[0268] Preparation of 2,4,6-trichlorotriazine derived oligonucleotides: 10 to 1000 u,g of 5'-terminal amine linked oli- 
gonucleotide are reacted with an excess recrystallized cyanuric chloride in 10% n-methyl-pyrrolidone in alkaline (pH 
8.3 to 8.5 preferably) buffer at 19°C to 25°C for 30 to 120 minutes. The final reaction conditions consist of 0.15 M 
sodium borate at pH 8.3, 2 mg/ml recrystallized cyanuric chloride and 500 ug/ml respective oligonucleotide. The un- 
reacted cyanuric chloride is removed by size exclusion chromatography on a G-50 Sephadex (Pharmacia, Piscataway, 
NJ) column. 

[0269] The activated purified oligonucleotide is then reacted with a 100-fold molar excess of cystamine in 0.15 M 
sodium borate at pH 8.3 for 1 hour at room temperature. The unreacted cystamine is removed by size exclusion chro- 
matography on a G-50 Sephadex column. The derived ODNs are then reacted with amine-reactive fluorochromes. 
The derived ODN preparation is divided into 3 portions and each portion is reacted with either (a) 20-fold molar excess 
of Texas Red sulfonyl chloride (Molecular Probes, Eugene ; OR), with (b) 20-fold molar excess of Lissamine sulfcnyl 
chloride (Molecular Probes, Eugene, OR), (c) 20-fold molar excess of fluorescein isothiocyanate. The final reaction 
conditions consist of 0.15 M sodium borate at pH 8.3 for 1 hour at room temperature. The unreacted fluorochromes 
are removed by size exclusion chromatography on a G-50 Sephadex column. 

[0270] To cleave the fluorochrome from the oligonucleotide, the ODNs are adjusted to 1 x 1 0r 5 molar and then dilutions 
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are made (12, 3-fold dilutions) in TE (TE is 0.01 M Tris, pH 7.0, 5 mM EDTA). To 100 u,l volumes of ODNs 25 uJ of 0,01 
M dithiothraitol (DTT) is added. To an identical set.of controls no DDT is added. The mixture is incubated for 15 minutes 
at room temperature. Fluorescence is measured in a black microtiter plate. The solution is removed from the incubation 
tubes (1 50 microliters) and placed in a black microtiter plate (Dynatek Laboratories, Chantilly, VA). The plates are then 
read directly using a Fluoroskan II fluorometer (Flow Laboratories, McLean, VA) using an excitation wavelength of 495 
nm and monitoring omission at 520 nm for fluorescein, using an excitation wavelength of 591 nm and monitoring 
emission at 61 2 nm for Texas Red, and using an excitation wavelength of 570 nm and monitoring emission at 590 nm 
for iissamine. 



w 



20 



Moles of Fluorochrome 


RFU non-cleaved 


RFU cleaved 


RFU free 


1.0x10 5 M 


6.4 


1200 


1345 


3.3 x 1Q6M 


2.4 


451 


456 


1.1 x 10 6 M 


0.9 


135 


130 


3.7 x 10?M 


0.3 


44 


48 


1.2x 10 7 M 


0.12 


15.3 


16.0 


4.1 x 10 7 M 


0.14 


4.9 


5.1 


1.4 x 10 a M 


0.13 


2.5 


2.8 


4.5 x 10 9 M 


0.12 


0.8 


0.9 



The data indicate that there is about a 200-fold increase in relative fluorescence when the fluorochrome is cleaved 
from the ODN. 



EXAMPLE 4 

PREPARATION OF TAGGED M13 SEQUENCE PRIMERS AND DEMONSTRATION OF CLEAVAGE OF TAGS 



[0271 J Preparation of 2,4,6-trichlorotriazino derived oligonucleotides; 1GG0u,g of S'-terminal ammo linked oligonu- 
cleotide (5 , -hoxylamine-TGTAAAACGACGGCCAGT-3 n ) (Seq. ID No. 1 ) are reacted with an excess recrystallized cy- 
anuric chloride in 10% n-meihyi-pyrroiidono aikaime (pH 8.3 to 8.5 preferably) buffer at 19 to 25- C for 30 to 120 
minutes. The final reaction conditions consist of 0.15 M sodium borate at pH 8 3, 2 mg/mi recrystallized cyanuric 
chloride and 500 ug/mi respective oligonucleotide. The unreacted cyanuric chloride is removed by size exclusion chro- 
matography on a G-50 Sephadex column 

[0272] The activated purified oligonucleotide is then reacted with a 100-fold molar excess of cystamino in 0 15 M 
sodium borate at pH 8.3 for 1 hour at room temperature. The unreacted cystamine is removed by size exclusion chro- 
matography on a G-50 Sephadex column. The Jo lived CCN;.; u * then ioueted ;v::h a vu;:ely oi ■ *mid : 'c. derive- 1 
ODN preparation is divided into 12 portions and each portion is reacted (25 molar excess) with the pentafluorophenyl- 
esters of either: (1 ) 4-methoxybenzoic acid, (2) 4-fluorobenzoic acid, (3) toluic acid, (4) benzoic acid, (5) indole-3-acetic 
acid. (6) 2,6-difluorobenzoic acid, (7) nicotinic acid N-oxide, (8) 2-nitrobenzoic acid, (9) 5-acetylsalicylic acid, (10) 
4-ethoxybenzoic acid, (11) cinnamic acid, (12) 3-aminonicotinic acid. The reaction is for 2 hours at 37 0 C in 0.2 M 
NaBorate pH 8.3. The derived ODNs are purified by gel exclusion chromatography on G-50 Sephadex. 
[0273] To cleave the tag from the oligonucleotide, the ODNs are adjusted to 1 x 10" 5 molar and then dilutions are 
made (12, 3-fold dilutions) in TE (TE is 0.01 M Tris, pH 7.0, 5 mM EDTA) with 50% EtOH (VA/). To 100 uJ volumes of 
ODNs 25 uJ of 0.01 M dithiothreitoi (DTT) is added. To an identical set of controls no DDT is added. Incubation is for 
30 minutes at room temperature. NaCI is then added to 0.1 M and 2 volumes of EtOH is added to precipitate the ODNs. 
The ODNs are removed from solution by centrifugation at 14,000 x G at 4°C for 15 minutes. The supernatants are 
reserved, dried to completeness. The pellet is then dissolved in 25 uJ MeOH. The pellet is then tested by mass spec- 
trometry for the presence of tags. 

[0274] The mass spectrometer used in this work is an external ion source Fourier-transform mass spectrometer 
(FTMS). Samples prepared for MALDI analysis are deposited on the tip of a direct probe and inserted into the ion 
source. When the sample is irradiated with a laser pulse, ions are extracted from the source and passed into a long 
quadrupole ion guide that focuses and transports them to an FTMS analyzer cell located inside the bore of a super- 
conducting magnet. 

[0275] The spectra yield the following information. Peaks varying in intensity from 25 to 100 relative intensity units 
at the following molecular weights: (1) 212.1 amu indicating 4-methoxybenzoic acid derivative, (2) 200.1 indicating 
4-fluorobenzoic acid derivative, (3) 196.1 amu indicating toluic acid derivative, (4) 182.1 amu indicating benzoic acid 
derivative, (5) 235.2 amu indicating indole-3-acetic acid derivative, (6) 218.1 amu indicating 2,6-difluorobenzoic deriv- 
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alive, (7)199.1 amu indicating nicotinic acid N-oxide derivative, (8) 227.1 amu indicating 2-nitrobenzamide, (9) 179.18 
amu indicating 5-acetylsalicylic acid derivative, (10) 226.1 amu indicating 4-ethoxy benzoic acid derivative, (11 )209.1 
amu indicating cinnamic acid derivative, (12)198.1 amu indicating 3-aminonicotinic acid derivative. 
[0276] The results indicate that the MW-identifiers are cleaved from the primers and are detectable by mass spec- 
s trometry. 

EXAMPLE 5 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA R 1 . 36 -LYS(e-iNIP)-ANP-TFP 

10 

[0277] Figure 3 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L h ), where L h is an activated 
ester (specifically tetrafluorophenyl ester), L 2 is an ortho-nitrobenzylamine group with L 3 being a methylene group that 
links L h and L 2 , T has a modular structure wherein the carboxylic acid group of lysine has been joined to the nitrogen 
atom of the L 2 benzylamine group to form an amide bond, and a variable weight component R^, (where these R 
is groups correspond to T 2 as defined herein, and may be introduced via any of the specific carboxylic acids listed herein) 
is bonded through the a-amino group of the lysine, while a mass spec sensitivity enhancer group (introduced via N- 
methylisonipecotic acid) is bonded through the {--amino group of the lysine. 
[0278] Referring to Figure 3: 

Step A. NovaSyn HMP Resin (available from NovaBiochern; 1 eq.) is suspended with DMF in the collection vessel of 

20 the ACT357. Compound I (ANP available from ACT; 3 eq.), HATU (3 eq.) and NMM (7.5 eq.) in DMF are added and 
the collection vessel shaken for 1 hr. The solvent is removed and the rosin washed with NMP (2X), MeOH (2X), and 
DMF (2X). The coupling of I to the resin and the wash steps are repeated, to give compound II. 
Step B. The resin (compound II) is mixed with 25% piperidine in DMF and shaken for 5 mm. The resin is filtered, then 
mixed with 25% piperidine in DMF and shaken for 10 mm. The solvent is removed, the resin washed with NMP (2X), 

2S MeOH (2X), and DMF (2X), and used directly in step C. 

Step C. The doprotected resin from step B is suspended in DMF and to it is added an FMOC-protected amino acid, 
containing a protected amine functionality in its side chain (Fmoc-Lysine(Aloc)-OK available from PerSeptive Biosys- 
tems; 3 eq ), HATU (3 eq.), and NMM (7.5 eq.) in DMF Tho vessel is shaken for 1 hr. The solvent is removed and the 
resin washed with NMP (2X), MeOH (2X), and DMF (2X). The coupling of Fmoc-Lys(Aloc)-OH to the resin and the 

3Q wash steps are repeated, to give compound IV. 

Step D. The rosin (compound IV) is washed with CH 2 Cl 2 (2X), and then suspended in a solution of (PPh 3 ) 4 Pd (0) (0.3 
eq.) and PhSiH 3 (10 eq.) in CH 2 CI 2 . The mixture is shaken for 1 hr. The solvent is removed and the resin is washed 
with CH 2 CI 2 (2X). The palladium step is repeated. The solvent is removed and the resin is washed with CH 2 CI 2 (2X), 
N,N-diisopropylethylammonium diethyldithiocarbamate in DMF (2X), DMF (2X) to give compound V. 

35 Step E. The deprotected resin from step D is coupled with N-methylisonipecotic acid as described in step C to give 
compound VI. 

Step F. The Fmoc protected resin VI is divided equally by the ACT357 from the collection vessel into 36 reaction vessels 
to give compounds Vl,_ 36 . w 

Step G. The resin (compounds Vl^g) is treated with piperidine as described in step 3 to remove tho FMOC group. 

40 Step H. The 36 aliquots of deprotected resin from step G are suspended in DMF To each reaction vessel is added the 
appropriate carboxylic acid (R^aeCOgH; 3 eq.), HATU (3 eq.), and NMM (7.5 eq.) in DMF. The vessels are shaken for 
1 hr. The solvent is removed and the aliquots of resin washed with NMP (2X), MeOH (2X), and DMF (2X). The coupling 
of R-t-36^^2^ t0 tr1e a '' c l uo ts of resin and the wash steps are repeated, to give compounds VIII 1-36 . 
Step I. The aliquots of resin (compounds Vlll^) are washed with CH 2 CI 2 (3X). To each of the reaction vessels is 

45 added 90:5:5 TFA:H20:CH 2 C! 2 and the vessels shaken for 120 mm. The solvent is filtered from the reaction vessels 
into individual tubes. The aliquots of resin are washed with CH 2 CI 2 (2X) and MeOH (2X) and the filtrates combined 
into the individual tubes. The individual tubes are evaporated in vacuo, providing compounds IX 1 . 36 . 
Step J. Each of the free carboxylic acids JX 1-36 is dissolved in DMF. To each solution is added pyridine (1.05 eq.), 
followed by tetrafluorophenyl trifluoroacetate (1.1 eq.). The mixtures are stirred for 45 min. at room temperature. The 

so solutions are diluted with EtOAc, washed with 5% aq. NaHC0 3 (3X), dried over Na 2 S0 4 , filtered, and evaporated in 
vacuo, providing compounds X 1 . 36 . 

EXAMPLE 6 

55 PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA R, . 36 -LYS(e-INIP)-NBA-TFP 



[0279] Figure 4 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L^); where L h is an activated 

ester (specifically, tetrafluorophenyl ester), L 2 is an ortho-nitrobenzylamine group with L 1 being a direct bond between 
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L h and L 2 , where L h is joined directly to the aromatic ring of the L 2 group, T has a modular structure wherein the 
carboxylic acid group of lysine has been joined to the nitrogen atom of the L 2 benzylamine group to form an amide 
bond, and a variable weight component R-).^, (where these R groups correspond to T 2 as defined herein, and may 
be introduced via any of the specific carboxylic acids listed herein) is bonded through the a-amino group of the lysine, 
while a mass spec enhancer group (introduced via N-methylisonipecctic acid) is bonded through the e-ammo group of 
the lysine. 

[0280] Referring to Figure 4 

Stop A. NovaSyn HMP Resin is coupled with compound I (NBA prepared according to the procedure of Brown et a!. ( 
Molecular Diversity, 1 , 4 (1 995)) according to the procedure described in step A of Example 5, to give compound II. 
Steps B-J. The resin (compound II) is treated as described in steps B-J of Example 5 to give compounds X^g. 

EXAMPLE 7 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA INIP-LYS (e-R^J-ANP-TFP 

[0281] Figure 5 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L h ), where L h is an activated 
ester (specifically, tetrafluorophenyl ester),. L 2 is an ortho-nitrobenzylamino group with L 3 being a methylene group that 
links L h and L 2 , T has a modular structure wherein the carboxylic acid group of lysine has been joined to the nitrogen 
atom of the L 2 benzylamine group to form an amide bond, and a variable weight component R t _36» (where these R 
groups correspond to T 2 as defined herein, and may be introduced via any oi the specific carboxylic acids listed herein) 
is bonded through the e-amino group of the lysine, while a mass spec sensitivity enhancer group (introduced via N- 
methylisonipecotic acid) is bonded through the a-ammo group of the lysine. 
[0282] Referring to Figure 5: 
Steps A-C. Same as in Example 5 

Step D. The resin (compound IV) is treated with pipendine as described in step B of Example 5 to romovo the FMOC 
group. 

Step E. The deprotected a-amine on the resin in step D is coupled with N-methyiisonipecotic acid as described in step 
C of Example 5 to give compound V. 
Step F. Same as in Example 5. 

Step G. The resin (compounds V!,. ;]6 ) are treated with palladium as described in step D of Example 5 to remove the 
Aloe group. 

Steps H-J The compounds X,. 36 are prepared in the same manner as in Example 5. 
EXAMPLE 8 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA R 1-36 -GLU(y-DIAEA)-ANP-rFP 

[0283] Figure 6 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L h ), where L h is an activated 
ester (specifically tetrafluorophenyl ester) L 2 is an ortho-nitrobenzylamine group with L 3 being a methylene group that 
links L h and L 2 , T has a modular structure wherein the a-carboxyiic acid group of glutamatic acid has been joined to 
the nitrogen atom of the L 2 benzylamine grci jp to form an amide bond, and a variable weight component fi-,. 36l (where 
these R groups correspond to T 2 as detined herein, and may be introduced via any of the specific carboxylic acids 
listed herein) is bonded through the aa-amino group of the glutamic acid, while a mass spec sensitivity enhancer group 
(introduced via 2-(diisoprcpylamino)ethylamine) is bonded through the 7-carboxylic acid of the glutamic acid. 
[0284] Referring to Figure 6: 
Steps A-B. Same as in Example 5. 

Step C. The deprotected resin (compound III) is coupled to Fmoc-Glu-(OAI)-OH using the coupling method described 
in step C of Example 5 to give compound IV. 

Step D. The aNyl ester on the resin (compound IV) is washed with CH 2 CI 2 (2X) and mixed with a solution of (PPh 3 ) 4 Pd 
(0) (0.3 eq.) and N-methy!aniline (3 eq.) in CH 2 CI 2 . The mixture is shaken for 1 hr. The solvent is removed and the 
resin is washed with CH 2 CI 2 (2X). The palladium step is repeated. The solvent is removed and the resin is washed 
with CH 2 CI 2 (2X), N,N-diisopropylethylammonium diethyldithiocarbamate in DMF (2X), DMF (2X) to give compound V 
Step E. The deprotected resin from step D is suspended in DMF and activated by mixing HATU (3 eq.), and NMM (7.5 
eq.). The vessels are shaken for 15 minutes. The solvent is removed and the resin washed with NMP (1X). The resin 
is mixed with 2-(diisopropylamino)ethyiamine (3 eq.) and NMM (7.5 eq.). The vessels are shaken for 1 hour. The 
coupling of 2-(diisopropyiamino)ethy!arnine to the resin and the wash steps are repeated, to give compound VI. 
Steps F-J. Same as in Example 5. 
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EXAMPLE 9 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA Rp 36 -LYS(e-INIP)-ANP-LYS(e-NH 2 )-NH 2 

[0285] Figure 7 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L h ), where L h is an amine 
(specifically, the e-amino group of a lysine-derived moiety), L 2 is an ortho-nitrobenzylamme group with L 3 being a 
carboxamido-substituted alkyleneaminoacylalkylene group that links L h and L 2 , T has a modular structure wherein the 
carboxylic acid group of lysine has been joined to the nitrogen atom of the L 2 benzylamine group to form an amide 
bond, and a variable weight component R^e, (where these R groups correspond to T 2 as defined herein, and may 
be introduced via any of the specific carboxylic acids listed herein) is bonded through the a-amino group of the lysine, 
while a mass spec sensitivity enhancer group (introduced via N-methylisonipecotic acid) is bonded through the E- 
amino group of the lysine. 
[0286] Referring to Figure 7: 

Step A. Fmoc-Lys(Boc)-SRAM Resin (available from ACT; compound I) is mixed with 25% piperidine in DMF and 

shaken for 5 min. The resin is filtered, then mixed with 25% piperidine in DMF and shaken for 1 0 min. The solvent is 

removed, the resin washed with NMP (2X), MeOH (2X), and DMF (2X), and used directly in step B. 

Step B . The resin (compound II), ANP (available tram ACT; 3 eq.), HATU (3 eq.) and NMM (7.5 eq.) in DMF are added 

and the collection vessel shaken for 1 hr. The solvent is removed and the resin washed with NMP (2X), MeOH (2X), 

and DMF (2X) The coupling of I to the resin and the wash steps are repeated, to give compound Hi. 

Steps C-J. The resin (compound III) is treated as in steps B-l in Example 5 to give compounds X 1 . 36 . 

EXAMPLE 10 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA R,. 36 -LYS(e-TFA)-LYS(e -IINP)-ANP-TFP 

[0287] Figure 8 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = L h ), where L h is an activated 
ester (specifically, tetrafluorophenyl ester) : L 2 is an ortho-nitrobenzylamme group with L 3 being a methylene group that 
links L h and L 2 , T has a modular structure wherein the carboxylic acid group of a first lysine has been joined to the 
nitrogen atom o1 the L 2 benzylamine group to form an amide bond, a mass spec sensitivity enhancer group (introduced 
via N-methylisonipecotic acid) is bonded through the e-amino group of the first lysine, a second lysine molecle has 
been joined to the first lysine through the a-amino group of the first lysine, a molecular weight adjuster group (having 
a trifluoroacetyl structure) is bonded through the e-amino group of the second lysine, and a variable weight component 
^i-36» (where these R groups correspond to T 2 as defined herein, and may be introduced via any of the specific car- 
boxylic acids listed herein) is bonded through the a-amino group of the second lysine. 
[0288] Referring to Figure 3. 

Steps A-E. These steps are identical to steps A-E in Example 5. 

Step F . The resin (compound VI) is treated with piperidine as described in step B in Example 5 to remove the FMOC 
group. 

Step G. The deprotected resin (compound VII) is coupled to Fmcc-Lys(Tfa)-OH using the coupling method described 
in step C of Example 5 to give compound VIII. 

Steps H-K, The resin (compound VIII) is treated as in steps F-J in Example 5 to give compounds X\ v36 . 
EXAMPLE 11 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA . 36 -LYS(e-!NIP)-ANP-5'-AH-ODN 

[0289] Figure 9 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = MOI, where MOI is a nucleic 
acid fragment, ODN) derived from the esters of Example 5 (the same procedure could be used with other T-L-X com- 
pounds wherein X is an activated ester). The MOI is conjugated to T-L through the 5' end of the MOI. via a phosphodi- 
ester - alkyleneamine group. 
[0290] Referring to Figure 9: 

Step A. Compounds XII-,_ 36 are prepared according to a modified biotinylation procedure in Van Ness et al. t Nucleic 
Acids Res., 19, 3345 (1991). To a solution of one of the 5'-aminohexyl oligonucleotides (compounds X\ 1 . 36 , 1 mg) in 
200 mM sodium borate (pH 8.3, 250 mL) is added one of the Tetrafluorophenyl esters (compounds X u36 from Example 
A, 100-fold molar excess in 250 mLof NMP). The reaction is incubated overnight at ambient temperature. The unreacted 
and hydrolyzed tetrafluorophenyl esters are removed from the compounds XM-,. 36 by Sephadex G-50 chromatography. 
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EXAMPLE 12 

PREPARATION OF A SET OF COMPOUNDS OF THE FORMULA R 1 . 36 -LYS(e-iNIP)-ANP-LYS(e-(MCT-5 , -AH-ODN))- 
NH 2 

[0291] Figure 10 illustrates the parallel synthesis of a set of 36 T-L-X compounds (X = MOI, where MOI is a nucleic 
acid fragment, ODN) derived from the amines of Example 11 (the same procedure could be used with other T-L-X 
compounds wherein X is an amine). The MOI is conjugated to T-L through the 5' end of the MOI, via a phosphodiester 
- alkyleneamine group. 
[0292] Referring to Figure 10: 

Step A. The 5'-[6-(4,6-dichloro-1 ,3,5-triazin-2-ylamino)hexyl]oligonucleotides Xll^g are prepared as described in Van 
Ness et al., Nucleic Acids Res., 19, 3345 (1991). 

Step B. To a solution of one of the 5 , -[6-(4,6-dichioro-1 ,3,5-triazin-2-ylamino)hexyl]oligonucieotides (compounds 
XI 1^36) at a concentration of 1 mg/ml in 100 mM sodium borate (pH 8.3) was added a 100-fold molar excess of a 
primary amine selected from R 1 . 36 -Lys(e-iNIP)-ANP-Lys(e-NH 2 )-NH 2 (compounds X^g from Example 11). The solu- 
tion is mixed overnight at ambient temperature. The unreacted amine is removed by ultrafiltration through a 3000 MW 
cutoff membrane (Amicon, Beverly, MA) using H 2 0 as the wash solution (3 X). The compounds XIII 1<J6 are isolated 
by reduction of the volume to 100 mL 

EXAMPLE .3 

DEMONSTRATION OF THE SIMULTANEOUS DETECTION OF MULTIPLE TAGS BY MASS SPECTROMETRY 

[0293] This example provides a description of the ability to simultaneously detect multiple compounds (tags) by mass 
spectrometry. In this particular example, 31 compounds are mixed with a matrix, deposited and dried on to a solid 
support and thon desorbed with a laser. The resultant ions are then introduced in a mass spectrometer. 
[0294] The following compounds (purchased from Aldnch. Milwaukee, Wl) are mixed together on an equal molar 
basis to a final concentration of 0.002 M (en a per compound) basis: benzamido (121.14), nicotinamide (122.13), 
pyrazinamide (123.12), 3-amino-4-pyrazolecarboxylic acid (127.10) : 2-thiophenecarboxamide (127.17), 4-amtnoben- 
zamide (135.15), tolumide (135.17), 6-methytnicotinamide (136.15), 3-aminonicotinamide (137.14), nicotinamide N- 
oxido (138.12), 3-hydropicolinamide (138.13), 4-fluorobenzamide (139.13), c in namamide (147.18), 4-methoxybenza- 
mide (151.17), 2,6-difluorbenzamide (157.12), 4-amino-5-imidazole-carboxyamide (162.58), 3,4-pyridine-dicarboxya- 
mide (165.16), 4-ethoxybenzamide (165.19), 2,3-pyrazinedicarboxamide (166.14), 2-nitrobenzamide (166.14), 
3-fluoro-4-methoxybenzoic acid (170.4), indole-3-acetamide (174.2). 5-acetylsalicylamide (179.18), 3,5-dimethoxy- 
benzamide (181.1 9) : 1 -naphthaleneacetamide (1 85.23), 8-chloro-3,5-diamino-2-pyrazinecarboxyamide (187.59), 4-tri- 
fluoromethyl-benzamide (189.00), 5-amino-5-phenyl-4-pyrazole-carboxamide (202.22), 1 -methyl -2-benzy I -malona- 
mate (207.33), 4-amino-2,3,5.6-tetrafiuorobenzamide (208.11), 2,3-napthlenedicarboxyIic acid (212.22). The com- 
pounds are placed in DMSO at the concentration described above. One pj of the material is then mixed with alpha- 
cyano-4-hydrcxy cinnamic acid matrix (after a 1:10,000 dilution) and deposited on to a solid stainless steel support. 
[0295] The material is then desorbed by a laser using the Protein TOP Mass Spectrometer (Bruker, Manning Park, 
MA) and the resulting ions are measured in both the iinear and reflectron modes of operation. The following m/z values 
are observed (Figure 11): 



121.1 -> 


benzamide (121.14) 


122.1-* 


nicotinamide (122.13) 


123.1-* 


pyrazinamide (123.12) 


124.1 




125.2 




127.3-> 


3-amino-4-pyrazolecarboxylic acid (127.10) 


127.2-* 


2-thiophenecarboxamide (127.17) 


135.1-* 


4-aminobenzamide (135.15) 


135.1-4 


tolumide(135.17) 


136.2-4 


6-methylnicotinamide ( 1 36. 1 5) 


137.1-> 


3-aminonicotinamide (137.14) 


138,2~> 


nicotinamide N-oxide (138.12) 


138.2-4 


3-hydropicolinamide (138.13) 
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(continued^ 



1 > 


4-fluorobenzamidQ (139.13) 








(^innamamiHo / 1 A~7 1 Q \ 

ciririarTiarnicjs { i*r/. 






1 49.2 


4-methoxybenzamide (151.17) 


1 52.2 


2.6~difluorbenzamide (157.12) 


158.3 


4-amino-5-imidazole-carboxyamidG (162.58) 


163.3 




165.2-* 


3 : 4-pyridine-dicarboxyamide (165.16) 


165. 2-* 


4-ethoxybenzamide (165.19) 


166. 2-> 


2, 3-pyrazinedicarboxamide (1 66. 1 4) 


166.2-* 


2-nitrobenzamide (166.14) 




3-fluoro-4-methoxybenzoic acid (170.4) 


171 .1 




1 72.2 




1 73.4 


inaole-3-acetamide (174.2) 


178.3 




179.3 -> 


5-acetylsaIicylamide (179.18) 


1 81 .2 — > 


3,5-dimethoxybenzamide (181.19) 


132.2 -> 


l-naphthalonGactMamide (185.23) 


136.2 


3-chloro-3 t 5-d!amino-2-pyrazinGcarboxyamidc (187 59) 


1 88.2 




189.2--* 


4-trifluoromethyl-benzamide (1 89.00) 


190.2 




191.2 




1 92.3 


5-amino-5-phenyl-4-pyrazole-carboxamide (202.22) 


203.2 




203.4 


1 -mQthvl-2-hAn7\y!-ma)onarn^tA (907 ICW 4-Rminn-P *3 ^ fi-f Atraflijnrnhfin/amiHf* f208 111 

I ii io u i y i c vjt7HZ.yi 1 1 IQIUI 1 cm Halt? \c^J / . sJ*J j *-r cii i 1 1 1 I w c , vJ, o , vj lOllallUUIUUOi itai i i iuo ^t-uu. i i f ^ 


212.2-* 


2,3-napthlenadicarboxylic acid (212.22). 


219.3 




221.2 




228.2 




234.2 




237.4 




241.4 





[0296] The data indicate that 22 of 31 compounds appeared in the spectrum with the anticipated mass, 9 of 31 
compounds appeared in the spectrum with an + H mass (1 atomic mass unit, amu) over the anticipated mass. The 
latter phenomenon is probably due to the protonation of an amine within the compounds. Therefore 31 of 31 compounds 
are detected by MALDI Mass Spectroscopy. More importantly, the example demonstrates that multiple tags can be 
detected simultaneously by a spectroscopic method. 

[0297] The alpha-cyano matrix alone (Figure 11 ) gave peaks at 146.2, 164.1, 172.1, 173.1, 189.1, 190.1, 191.1, 
192.1, 212.1 , 224.1, 228.0, 234.3. Other identified masses in the spectrum are due to contaminants in the purchased 
compounds as no effort was made to further purify the compounds. 

EXAMPLE 14 

MICROS ATE LLITE MARKERS: PCR AMPLIFICATIONS. 

[0298] The microsatellite markers are amplified utilizing the following standard PCR conditions. Briefly, PCR reactions 
are performed in a total volume of 50 uj, containing 40 ng of genomic DNA, 50 pmol of each primer, 0. 125 mM dNTPs 
and 1 unit of Tag polymerase. 1X amplification buffer contains 10 mM Tris base, pH 9, 50 mM KCI, 1.5 mM MgCI 2 , 
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0.1% Triton X-100 and 0.01% gelatin. The reactions are performed using a "hot-start" procedure: Taq polymerase is 
added only after a first denaturation step of 5 minutes at 96° C. Amplification is carried out for 35 cycles: denaturation 
(94°C for 40 sec) and annealing (55 9 C for 30 sec). An elongation step (72°C for 2 minutes) ends the process after the 
last annealing. Since the amplification products to be obtained are short (90 to 350 base pairs long) and the time 
interval to raise the temperature from 55°C to 94°C (obtained with a ramping rate ol 1 0 C/second) is long enough, 
completion of DNA elongation can be achieved without a step at 72°C. 



Claims 

1. A method for determining the presence of a nucleic acid molecule, comprising: 

(a) generating tagged nucleic acid molecules from one or more selected target nucleic acid molecules, wherein 
a tag is correlative with a particular nucleic acid fragment and detectable by spectrometry or potentiometry; 

(b) separating said tagged molecules by size; 

(c) cleaving said tag from said tagged molecule; and 

(d) detecting said tag by spectrometry or potenticmetry, and therefrom determining the presence of said nucleic 
acid molecule. 

2. A method for detecting a selected nucleic acid molecule, comprising: 

(a) combining a tagged nucleic acid probe with target nucleic acid molecules under conditions and for a time 
sufficient to permit hybridization of said tagged nucleic acid probe to a complementary selected target nucleic 
acid sequence, wherein said tagged nucleic acid probe comprises a tag that is correlative with a particular 
fragment and is detectable by spectrometry or potentiometry; 

(b) altering the size of said hybridized tagged probes, the size of unhybridized probes or target molecules, or 
the size of the probe: target hybrids; 

(c) separating the tagged probes by size; 

(d) cleaving said tag from said tagged prooe; and 

(e) detecting said tag by spectrometry or potentiometry, and therefrom detecting said selected nucleic acid 
molecule 

3. The method according to any one of claims 1-2 wherein the detection of the tag is by mass spectrometry infrared 
spectrometry, ultraviolet spectrometry, or, potentiostatic amperometry. 

4. The method according to any one of claims t -3 wherein greater than 4 tagged nucleic acid fragments are generated 
and wherein each tag is unique for a selected nucleic acid fragment. 

5. The method according to any one of claims 2-3 wherein greater than 4 tagged nucleic probes are utilized and 
wherein each tag is unique for a selected nucleic acid probe. 

6. The method according to any one of claims 1 -5 wherein said target nucleic acid molecule is generated by primer 

extension. 

7. The method according to any one of claims 2-5 wherein the size of safd hybridized tagged probes, unhybridized 
probes or target molecules, or probe.target hybrids are altered by a method selected from the group consisting of 
polymerase extension, ligation, exonuclease digestion and endonuclease digestion. 

8. The method according to any one of claims 1 -7 wherein said tagged molecules are separated by a method selected 
from the group consisting of gel electrophoresis, capillary electrophoresis, micro-channel electrophoresis, HPLC, 
size exclusion chromatography and filtration. 

9. The method according to any one of claims 1 -8 wherein said tagged molecules are cleaved by a method selected 
from the group consisting of oxidation, reduction, acid-labile, base labile, enzymatic, electrochemical, thermal and 
photolabile methods. 

10. The method according to any one of claims 1 -9 wherein said tag is detected by time-of-flight mass spectrometry, 
quadrupole mass spectrometry, magnetic sector mass spectrometry and electric sector mass spectrometry. 
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1 1 . The method according to any one of claims 1 -9 wherein said tag is detected by potentiostatic amperometry utilizing 
detectors selected from the group consisting of coulometrlc detectors and amperometric detectors. 

12. The method according to any one of claims 1-11 wherein the steps of separating, cleaving and detecting are 
performed in a continuous manner. 

13. The method according to any one of claims 1-12 wherein the steps separating, cleaving and detecting are per- 
formed in a continuous manner on a single device. 

14. The method according to any one of claims 1-13 wherein the steps of separating, cleaving and detecting are 
automated. 

15. The method according to any one of claims 1-14 wherein said tagged molecules or probes are generated from 
non 3'-tagged oligonucleotide primers. 

16. The method according to any one of claims 1-14 wherein said tagged molecules or probes are generated from 
tagged dideoxynucleotide terminators. 

17. The method according to any one of claims 1-2, 4-9 and 12-16 wherein said tag is detected by non-fluorescent 
spectrometry or potentiometry. 

18. A method for genotyping a selected organism, comprising: 

fa) generating tagged nucleic acid molecules from a selected target molecule wherein a lag is correlative with 
a particular fragment and may be detected by spectrometry or potentiometry; 
(b) separating said tagged molecules by size; 

fc) cleaving said tag ft cm said tagged molecule, and 

fd) detecting said tag by spectrometry or potentiometry. and therefrom determining the genotype of said or- 
ganism. 

19. A method for genotyping a selected organism, comprising: 

(a) combining a tagged nucleic acid molecule with a selected target molecule under conditions and for a time 
sufficient to permit hybridization of said tagged molecule to said target molecule, wherein a tag is correlative 
with a particular fragment and may be detected by spectrometry or potentiometry; 

(b) separating said tagged molecules by size; 

(c) cleaving said tag from said tagged molecule, and 

(d) detecting said tag by spectrometry or potontiometry, and therefrom determining the genotype of said or- 
ganism. 

20. The method according to claims 18 or 19 wherein said tagged molecules are generated from a pool of clones 
selected from the group consisting of genomic clones, cDNA clones and RNA clones. 

21. The method according to claims 18 or 19 wherein said tagged molecules are generated by polymerase chain 
reaction. 

22. The method according to claims 18 or 1 9 wherein said target nucleic acid molecule is generated by primer exten- 
sion. 

23. The method according to any one of claims 1 8-22 wherein the detection of the tag is by mass spectrometry, infrared 
spectrometry, ultraviolet spectrometry, or, potentiostatic amperometry. 

24. The method according to any one of claims 18-23 wherein greater than 4 tagged nucleic acid molecules are gen- 
erated and wherein each tag is unique for a selected nucleic acid fragment. 

25. The method according to any one of claims 18-24 wherein said tagged molecules are separated by a method 
selected from the group consisting of gel electrophoresis, capillary electrophoresis, micro-channel electrophoresis, 
HPLC, size exclusion chromatography and filtration. 
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26. The method according to any one of claims 1 8-25 wherein said tagged molecules are cleaved by a method selected 
from the group consisting of oxidation, reduction, acid-labile, base labile, enzymatic, electrochemical, thermal and 
photolabile methods. 

27. The method according to any one of claims 1 8-26 wherein said tag is detected by timeof-flight mass spectrometry, 
quadrupoie mass spectrometry, magnetic sector mass spectrometry and electric sector mass spectrometry. 

28. Tho method according to any one of claims 18-26 wherein said tag is detected by potentiostatic amperomotry 
utilizing detectors selected from the group consisting of coulometric detectors and amperometnc detectors. 

29. The method according to any one of claims 18-28 wherein the steps of separating, cleaving and detecting are 
performed in a continuous manner. 

30. Tho mothod according to any one of claims 18-29 wherein the steps of separating, cleaving and detecting are 
performed in a continuous manner on a single device. 

31. The method according to any one of claims 18-30 wherein the steps of separating, cleaving and detecting are 
automated. 

32. Tho method according to any one of claims 18-31 wherein said tagged molecules are generated from non-3'- 
tagged oligonucleotide primers. 

33. The method according to any one of claims 18-31 wherein said tagged molecules are generated from tagged 
dideoxynucleotide terminators 

34. Tho method according to any one of claims 1 -33 wherein said target molecule is obtained from a biological sample. 

35. Tho mothod according to any one of claims 18-34 wherein said tag is detected by non-fluorescent spectrometry 
or potentiometry. 

36. A composition comprising a plurality of compounds of the formula: 



wherein, 

T ms is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 

fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine; 

L is an organic group which allows a unique T m3 -containing moiety to be cleaved from the remainder of the 

compound, wherein the T ms -contaimng moiety comprises a functional group which supports a single ionized 

charge state when the compound is subjected to mass spectrometry and is selected from tertiary amine, 

quaternary amine and organic acid; 

MO! (molecule of interest) is a nucleic acid fragment; and 

wherein at least two compounds have the same T m5 but the MO I groups of those molecules have non-identical 
nucleotide lengths, wherein the MOI fragments do not terminate with the same dideoxynucleotide selected 
from ddAMR ddCMP, ddGMP and ddTMP. 

37. A composition comprising a plurality of compounds of the formula: 



T -L-MOI 



wherein, 

T ms is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 

fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine; 

L is an organic group which allows a unique T m9 -containing moiety to be cleaved from the remainder of the 
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compound, wherein the T m »-containing moiety comprises a functional group which supports a single ionized 
charge state when the compound is subjected to mass spectrometry and is selected from tertiary amine, 
quaternary amine and organic acid; 
MOI is a nucleic acid fragment; and 

wherein at least two compounds have the same P™* but those compounds have non-identical elution times 
as measured by HPLC using a column of non-porous 2.3 fim poiy(styrene-divinyibonzene)-C ia particles, and 
aqueous buffer comprising 100 mM triethylammonium acetate. 

38. A composition comprising a plurality of compounds of the formula: 



wherein, 

T ma is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 

fluonde, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine; 

L is an organic group which allows a unique T m3 -containing moiety to be cleaved from the remainder of the 

compound, wherein the T m3 -containing moiety comprises a functional group which supports a single ionized 

charge state when the compound is subjected to mass spectrometry and is selected from tertiary amine, 

quaternary amine and organic acid; 

MO! is a nucleic acid fragment; and 

wherein no two compounds which have the same MOI nucleotide length also have the same T™ 3 . 

39. A composition according to any one of claims 36-38 wherein the plurality is greater than 2. 

40. A composition according to any one of claims 3u-36 wherein the plurality is greater than 4. 

41. A composition according to any one of claim 36-40 wherein the nucleic acid fragment has a sequence comple- 
mentary to a portion of a vector, wherein the fragment is capable of priming polynucleotide synthesis. 

42. A composition according to any one of claim 36-41 wherein the T m3 groups of members of the plurality differ by 
at least 2 amu. 

43. A composition according to any one of claims 36-41 wherein the T ms groups of members of the plurality differ by 
at least 4 amu. 

44. A composition comprising a plurality of sets of compounds, each set of compounds having the formula: 

T ms -L-MO! 

wherein, 

T ms is an organic group detectable by mass spectrometry comprising carbon, at least one of hydrogen and 

fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine: 

L is an organic group which allows a unique T™ 3 -containing moiety to be cleaved from the remainder of the 

compound, wherein the T ms -containing moiety comprises a functional group which supports a single ionized 

charge state when the compound is subjected to mass spectrometry and is selected from tertiary amine, 

quaternary amine and organic acid; 

MOI is a nucleic acid fragment; and 

members within a first set of compounds have identical T ms groups, however have nan-identical MOI groups 
with differing numbers of nucleotides in the MOI and there are at least ten members within the first set, wherein 
between sets, the T ms groups differ by at least 2 amu. 

45. A composition according to claim 44 wherein the plurality is at least 3. 

46. A composition according to claim 44 wherein the plurality is at least 5. 
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47. A composition comprising a plurality of sets of compounds, each set of compounds having the formula 

T m9 -L-MOI 

wherein, 

T m3 is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 

fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine: 

L is an organic group which allows a unique T nns -containing moiety to be cleaved from the remainder of the 

compound, wherein the T ms -containing moiety comprises a functional group which supports a single ionized 

charge state when the compound is subjected to mass spectrometry and is selected from tertiary amine, 

quaternary amine and organic acid; 

MO I is a nucleic acid fragment; and 

wherein compounds within a set have non-identical T ms groups but the same elution times as measured by 
HPLC using a column of non-porous 2.3 pm poly(styrene-divinyibenzene)-C 18 particles, and aqueous buffer 
comprising 100 mM triethylammonium acetate. 

48. A kit for genotyping, comprising a plurality of amplification primer pairs, wherein at least one of the primers has 
the formula: 

T ma -L-MOI 

wherein, 

T m;i is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atcms selected from oxygen, nitrogen, sulfur, phosphorus and iodine: 
L is an organic group which allows a unique T mo -containing moiety to be cleaved from the remainder of the 
compound, wherein the T m3 -containmg moiety comprises a functional group which supports a single ionized 
charge state when the compound is subjected to mass spectrometry and is selected from tertiary amine, 
quaternary amine and organic acid; and 

MOI is a nucleic acid fragment; and each primer pair associates with a different loci. 

49. A kit according to claim 48 wherein the plurality is at least 3. 

50. A kit according to claim 48 wherein the plurality is at least 5. 



Patentanspruche 

1. Verfahren zur Bestimmung des Vorhandenseins eines Nukleinsauremolekuls, welches umfaOt: 

a) Erzeugen markierter Nukleinsauremolekule aus einem oder mehreren ausgewahlten Target-Nukleinsaure- 
molekulen, wobei eine Markierung mit einem bestimmten Nukleinsaurefragment korrelativ und durch Spek- 
trometrie oder Potentiometrie nachweisbar ist; 

b) Abtrennen besagter markierter Molekule nach Grofle; 

c) Abspalten besagter Markierung von besagtem markiertem Molekul; und 

d) Nachweisen besagter Markierung durch Spektrometrie oder Potentiometrie und Bestimmen des Vorhan- 
denseins besagten Nukleinsauremolekuls daraus. 

2. Verfahren zum Nachweisen eines ausgewahlten Nukleinsauremolekuls, welches umfaf3t: 

a) Kombinieren eines markierten Nukleinsauresonde mit Target-Nukteinsauremofekuien unter Bedingungen 
und fur einen Zeitraum, die ausreichend sind, um Hybridisierung besagter markierter Nukleinsauresonde an 
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aine komplementare ausgewahlte Target-Nukleinsauresequenz zu ermoglichen, wobei besagte markierte Nu- 
kleinsauresonde eine Markierung umfaftt, die mit einem bestimmten Fragment korrelativ ist und durch Spek- 
trometrio oder Potantiometrie nachweisbar ist; 

b) Verandem dar GroGe besagtar hybridisierter markiertar Sonden, der GroGe nicht-hybridisierter Sonden 
odor Targot-Molekule odar der GroGe dor Sonden:Target-Hybride; 

c) Abtrannon der markierten Sonden nach GroGe; 

d) Abspalten besagtar Markierung von besagtar markierter Sonde; und 

a) Nachweisen besagtar Markierung durch Spektrometria oder Potentiometrie und Nachwoisen besagten aus- 
gewahlten Nukleinsauremoiekuis daraus. 

3. Verfahren nach einam der Anspruche 1-2, dadurch gekennzeichnet, daG der Nachweis der Markierung durch 
Massenspektrornetrie, Infrarotspektrometrie, Ultraviolettspektrometrie oder potentiostatische Amperometrie er- 

folgt. 

4. Verfahren nach einem dar Anspruche 1-3, dadurch gekennzeichnet, daG mehr ais 4 markierte Nukieinsaurefrag- 
mente orzeugt werden und daG jade Markierung einzigartig fur am ausgewahltes Nukleinsaurefragment ist. 

5. Vertahren nach einem der Anspruche 2-3, dadurch gekennzeichnet, daG mehr als 4 markierte Nukleinsonden 
verwendet werden und daG jede Markierung einzigartig fur eine ausgewahlte Nukleinsauresonde ist. 

6. Verfahren nach einem der Anspruche 1-5, dadurch gekennzeichnet, daG besagtes Target-Nukleinsauremolekul 
durch Primer-Extension erzeugt wird. 

7. Verfahren nach einem der Anspruche 2-5, dadurch gekennzeichnet, daG die GroGe besagter hybridisierter mar- 
kiertar Sonden, nicht-hybridisierter Sonden oder Target-Molekule oder Sonden:Target-Hybride mit einem Verfah- 
ren verandert wird, das ausgewahlt ist aus der Gruppe, die aus Polymerase-Extension, Ligation, Exonuklease- 
Verdauung und Endonuklease-Verdauung besteht. 

3. Verfahren nach einem der Anspruche 1-7, dadurch gekennzeichnet, daG besagte markierte Moiekule mit einem 

Verfahren abgetrcnnt werden, das ausgewahlt ist aus der Gruppe, die aus Gelelektrophcrese, Kapillarelektropho- 
rese, Mikrokanaielektrophorese, HPLC, AusschluGchromatographie und Filtration besteht. 

9. Verfahren nach einem der Anspruche 1-8, dadurch gekennzeichnet, daG besagte markierte Moiekule mit einem 
Verfahren gespalten werden, das ausgewahlt ist aus der Gruppe, die aus Oxidations-, Reduktions-, saurelabilen, 
baseniabilen, enzymatischen, elektrochemischen, thermischen und fotolabilen Verfahren besteht. 

1 0. Verfahren nach einem der Anspruche 1 -9, dadurch gekennzeichnet, daG besagte Markierung durch Flugzeit-Mas- 
senspektrometrie, Quadrupol-Massenspektrometrie, Magnetsektor-Massenspektrometrie und Elektrosektor-Mas- 
senspektrometrie nachgewiesen wird. 

11. Verfahren nach einem der Anspruche 1-9, dadurch gekennzeichnet, daG besagte Markierung durch potentiosta- 
tische Amperometrie unter Verwendung von Detektoren nachgewiesen wird, die ausgewahlt sind aus der Gruppe, 
die aus coulometrischen Detektoren und amperometrischen Detektoren besteht. 

12. Verfahren nach einem der Anspruche 1-11, dadurch gekennzeichnet, daG die Schritte Abtrennen, Spalten und 
Nachweisen in einer kontinuierlichen Art und Weise durchgefuhrt werden. 

13. Verfahren nach einem der Anspruche 1-12, dadurch gekennzeichnet, daG die Schritte Abtrennen, Spalten und 
Nachweisen in ainer kontinuierlichen Art und Weise auf einer einzigen Vorrichtung durchgefuhrt werden. 

14. Verfahren nach einem der Anspruche 1-13, dadurch gekennzeichnet, daG die Schritte Abtrennen, Spalten und 
Nachweisen automatisiert sind. 

15. Verfahren nach einem der AnsprQche 1 -1 4, dadurch gekennzeichnet, daG besagte markierte Moiekule oder Son- 



EP 0 840 804 B1 



den aus nicht-3'-markierten Oiigonukleotidprimern erzeugt warden. 

16. Verfahren nach einem der Anspruche 1-14, dadurch gekennzeichnet, daft besagte markierte MoiekQIe Oder Son- 
■ den aus markierten Didesoxynukleotidterminatoren erzeugt warden. 

5 

17. Verfahren nach einem der Anspruche 1-2, 4-9 und 12-16, dadurch gekennzeichnet, daB besagta Markierung durch 
Nicht-Fluoreszenz-Spektrometne oder Potentiometrie nachgewiesen wird. 

18. Verfahren zum Ganotypisieren eines ausgewahlten Organismus, welches umfaBt: 

to 

a) Erzeugen markierter Nukieinsauremolekule aus einem ausgewahlten Target-Moiekul, wobei eine Markie- 
rung mit einem bestimmten Fragment korrelativ ist und durch Spektrcmetrie oder Potentiometrie nachgewie- 
sen warden kann; 

75 b) Abtrennen besagter markierter Molekule nach GroBe; 

c) Abspalten besagter Markierung von besagtem markierten Molekul; und 

d) Nachweisen besagter Markierung durch Spektrometrie oder Potentiometrie und Bestimmen des Genotyps 
20 besagten Organismus daraus. 

19. Verfahren zum Ganotypisieren eines ausgewahlten Organismus, welches umfaBt: 

a) Kombinieren eines markierten Nukleinsauremolekuls mit einem ausgewahlten Target-Moiekul unter Bedin- 
-5 gungen und fur einen Zeitraum, die aus r eichend sind, urn Hybridisierung besagten markierten Molekiils an 

bcsagtes Target-Moiekul zu ermdglichen, wobei eine Markierung mit einem bestimmten Fragment korrelativ 
ist und durch Spektrometrie oder Potentiometrie nachgewiesen werden kann; 

b) Abtrennen besagten markierten Molekuls nach GroBe; 

30 

c) Abspalten besagter Markierung von besagtem markierten Molekul; und 

d) Nachweisen besagter Markierung durch Spektrometrie oder Potentiometrie und Bestimmen des Genotyps 
besagten Organismus daraus. 

35 

20. Verfahren nach Anspruch 18 oder 19, dadurch gekennzeichnet, daB besagte markierte Molekule aus einem Pool 
von Klonen erzeugt werden, ausgewahlt aus der Gruppe, die aus genomischen Klonen, cDNA-Klonen und RNA- 
Klonen besteht. 

*o 21. Verfahren nach Anspruchen 18 oder 19, dadurch gekennzeichnet, daft besagte markierte Molekule durch Poly- 
merasekettenreaktion erzeugt werden. 

22. Verfahren nach Anspruch 18 oder 19, dadurch gekennzeichnet, daB besagtes Target-Nukleinsauremolekul durch 
Primer-Extension erzeugt wird. 

45 

23. Verfahren nach Anspruchen 13-22, dadurch gekennzeichnet, daB der Nachweis der Markierung durch Massen- 
spektrometrie, Infrarotspektrometrie, Ultraviolettspektrometrie oder potentiostatische Amperometrie erfolgt 

24. Verfahren nach einem der Anspruche 18-23, dadurch gekennzeichnet, daft mehr als 4 markierte Nukletnsaure- 
50 molekule erzeugt werden und daB jede Markierung einzigartig fur ein ausgewahltes Nukleinsaurefragment ist. 

25. Verfahren nach einem der Anspruche 18-24, dadurch gekennzeichnet, daft besagte markierte Molekule mit einem 
Verfahren abgetrennt werden, das ausgewahlt ist aus der Gruppe, die aus Gelelektrophorese, Kapiflarelektropho- 
rese, Mikrokanalelektrophorese, HPLC, AusschluBchromatographie und Filtration besteht. 

35 

26. Verfahren nach einem der Anspruche 18-25, dadurch gekennzeichnet, daft besagte markierte Molekule mit einem 
Verfahren gespalten werden, das ausgewahlt ist aus der Gruppe, die aus Oxidations-, Reduktions-, saurelabilen, 
baselabilen, enzymatischen, elektrochemischen, thermischen und fotoiabilen Vertahren besteht. 
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27. Verfahren nach einem der Anspruche 18-26, dadurch gekennzeichnet, daB Markierung durch Flugzeit-Massen- 
spektrometria, Quadrupol-Massanspektrometrie, Magnetsektor-Massenspektrometrie und Elektrosektor-Massen- 
spaktrornetrie nachgewiesen wird. 

5 28. Verfahren nach ainem der Anspruche 18-26, dadurch gekennzeichnet, daft besagte Markierung durch patentio- 
statische Amperometrie unter Verwendung von Detektoren nachgewiesen wird, die ausgewahit sind aus der Grup- 
pe, die aus coulometrischen Detektoren und amperometrischen Detektoren besteht. 

29. Verfahren nach einem der Anspruche 18-28, dadurch gekennzeichnet, daB die Schritte Abtrennen, Spalten und 
10 Nachweisen in einer kontinuierlichen Art und Weise durchgefuhrt werden. 

30. Verfahren nach einem der Anspruche 18-29, dadurch gekennzeichnet, daB die Schritte Abtrennen, Spalten und 
Nachweisen in einer kontinuierlichen Art und Weise auf einer einzigen Vorrichtung durchgefuhrt werden. 

is 31. Verfahren nach einem der Anspruche 18-30, dadurch gekennzeichnet, daB die Schritte Abtrennen, Spalten und 
Nachweisen automatisiert sind. 

32. Verfahren nach einem der Anspruche 18-31, dadurch gekennzeichnet, daB besagte markierte Molekuie aus nicht- 
3'-markierten Oligonukleotidprimern erzeugt warden. 



33. Verfahren nach einem der Anspruche 18-31, dadurch gekennzeichnet, daB besagte markierte Molekuie aus mar- 
kierten Didesoxynukieotidteminatoren erzeugt werden. 

34. Verfahren nach einem der Anspruche 1 -33, dadurch gekennzeichnet, daB besagtes Target-Molekul aus einer bio- 



35. Verfahren nach einem der Anspruche 13-34, dadurch gekennzeichnet, daB besagte Markierung durch Nicht-Fluo- 

reszenz-Spektrometrio oder Potentiometrie nachgewiesen wird 

20 36. Zusammonsetzung, die mehrere Verbindungen der Formel: 



20 



25 



logischen Probe erhalten wird. 



T ms -L-MOf 



35 



umfaBt, in der 



T m3 eine durch Massenspektrometrie nachweisbare organische Gruppe ist, die Kohlenstoff, wenigstens eines 
von Wasserstoff und Ruorid und fakultative Atome umfaOt, die ausgewahit sind aus Sauerstoff, Stickstoff, 
Schwefel, Phosphor und lod; 



40 



L eine organische Gruppe ist, die ermogiicht, eine einzigartige T m3 -enthaltende Einheit vom Rest der Verbin- 
dung abzuspalten, wobei die T m3 -enthaltende Einheit eine funktionelle Gruppe urmfaBt, die einen einzigen 
ionisierten Ladungszustand unterstutzt, wenn die Verbindung Massenspektrometrie unterworfen wird, und 
ausgewahit ist aus tertiarem Amin, quartarem Amin und organischer Saure; 



45 



MOI (interessierendes Molekul) ein Nukleinsaurefragment ist; und 



50 



wobei wenigstens zwei Verbindungen dieseibe T ms aufweisea aber die MOI-Gruppen dieser Molekuie nicht- 
identische Nukleotidlangen haban, wobei die MOI-Fragmente nicht mit demselben wie Didesoxynukleotid en- 
den, das ausgewahit ist aus ddAMP, ddCMP, ddGMP und ddTMP. 



37, Zusammensetzung, die etne Mehrzahl von Verbindungen der Formel: 



55 



T -L-MO! 



urnfaBt, in der 
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T m * eine durch Massenspektrometrie nachweisbare organische Gruppe ist, die Kohienstoff, wenigstens eines 
von Wassarstoff und Fluorid und fakultative Atoms umfaGt, die ausgewahlt 3ind aus Sauerstoff, Stickstoff, 
Schwefel, Phosphor und lod; 

s L aine organische Gruppe ist, die ermoglicht, eine oinzigartige T m3 -enthaltende Einheit vom Rest der Verbin- 

dung abzuspalten, wobei die T m3 -enthaitende Einheit eine funktionella Gruppe umfaBt, die einen einzigen 
ionisierten Ladungszustand unterstutzt, wenn die Verbindung Massenspektrometrie unterworfen wird, und 
ausgewahlt ist aus tertiarem Amin, quartarem Am in und organischer Saure; 

10 MO I ein Nukleinsaurefragment ist; und 

wobei wenigstens zwei Verbindungen dieselbe T ms aufweisen, aber diese Verbindungen nicht-identische Elu- 
tionszeiten haben, gemessen mit HPLC unter Verwendung einer Saule aus nicht-pordsen 2,3 pm groOen Poly 
(styrol-divinylbenzoi)-C 18 -Teilchen und waBrigan Puffers, der 100 mM Triethylammoniumacetat umfaGt. 

75 

38. Zusammensetzung, die eine Mehrzahl von Verbindungen der Formel: 

T m3 -L-MOI 

20 

umfaGt, in dor 

T ms eine durch Massenspektrometrie nachweisbare organische Gruppe ist, die Kohienstoff,. wenigstens eines 
von Wasserstoff und Fluorid und fakultative Atomo umfaBt, die ausgewahlt sind aus Sauerstoff, Stickstoff, 
2S Schwefel, Phosphor und lod; 

L eine organische Gruppe ist, die ermoglicht, eine oinzigartige "P^-enthaltenda Einheit vom Rest der Verbin- 
dung abzuspalten, wobei die T ms -entha!tende Einheit eine funktionelle Gruppe umfaGt, die einen einzigen 
ionisierten Ladungszustand unterstutzt, wenn die Verbindung Massenspektrometrie unterworfen wird, und 
ausgewahlt ist aus tertiarem Amin, quartarem Amin und organischer Saure; 

MOI ein Nukleinsaurefragment ist; und 

wobei nicht zwei Verbindungen, die dieselbe MOI-Nukleotidlange aufweisen, auch dasselbe T ms aufweisen. 

35 

39. Eine Zusammensetzung nach einem der Anspruche 36-38, dadurch gekennzeichnet, daf3 die Mehrzahl mehr als 
2 ist. 

40. Zusammensetzung nach einem der Anspruche 36-38, dadurch gekennzeichnet, daB die Mehrzahl mehr als 4 ist. 

41. Zusammensetzung nach einem der Anspruche 36-40, dadurch gekennzeichnet, daO das Nukleinsaurefragment 
eine zu einem Abschnitt eines Vektors komplementare Sequenz aufweist, wobei das Fragment zum Priming der 
Polynukleotidsynthese in der Lage ist. 

45 42. Zusammensetzung nach einem der Anspruche 36-41, dadurch gekennzeichnet, daB die T ms -Gruppen von Mit- 
gliedern der Mehrzahl sich urn wenigstens 2 amu unterscheiden. 

43. Zusammensetzung nach einem der Anspruche 36-41, dadurch gekennzeichnet, daf3 die T ms -Gruppen von Mit- 
gliedern der Mehrzahl sich urn wenigstens 4 amu unterscheiden. 

50 

44. Zusammensetzung, die eine Mehrzahl von Satzen von Verbindungen umfaBt, wobei jeder Satz von Verbindungen 
die Formel: 

55 T ms -L-MOI 



hat, in der 
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T ma eine durch Massenspektrometrie nachweisbare organische Gruppe ist, die Kohlenstoff, wenigstens eines 
von Wasserstoff und Fluorid und fakuitative Atome umfaBt, die ausgewahit sind aus Sauerstoff, Stickstoff, 
Schwefel, Phosphor und lod; 

L eine organische Gruppe ist, die ermoglicht, eine einzigartige T ms -enthaltende Einheit vom Rest der Verbin- 
dung abzuspalten, wobei die T m3 -enthaltende Einheit eine funktionelle Gruppe umfaBt, die einen einzigen 
ionisierten Ladungszustand unterstutzt, wenn die Verbindung Massenspektrometrie unterworfen wird, und 
ausgewahit ist aus tertiarem Arnin, quartarem Amin und organischer Saure; 

MO I ein Nukleinsaurefragment ist; und 

Mitglieder in einem ersten Satz von Verbindungen identische T ms -Gruppen haben, jedoch nicht-identische 
MOI-Gruppen mit unterschiedlichen Zahlen von Nukieotiden im MOI haben und es wenigstens zehn Mitglieder 
im ersten Satz gibt, wobei zwischen Satzen die T ms -Gruppen sich um wenigstens 2 amu unterscheiden. 

45. Zusammensetzung nach Anspruch 44, wobei die Mehrzahl wenigstens 3 ist. 

46. Zusammensetzung nach Anspruch 44, wobei die Mehrzahl wenigstens 5 ist. 

47. Zusammensetzung, die eine Mehrzahl von Satzen von Verbindungen umfaOt, wobei jeder Satz von Verbindungen 
die Formel: 



hat, in der 

jms 0!nQ rjurch Massenspektrometrio nachweisbare organische Gruppe ist, die Kohlenstoff, wenigstens einos 
von Wasserstoff und Fluorid und fakuitative Atome umfaBt, die ausgewahit sind aus Sauerstoff, Stickstoff, 
Schwefel, Phosphor und iod; 

L eine organische Gruppe ist, die ermoglicht, eine einzigartige T m3 -enthaltende Einheit vom Rest der Verbin- 
dung abzuspalten, wobei die T^-enthaitende Einheit eine funktionelle Gruppe umfaBt, die einen einzigen 
ionisierten Ladungszustand unterstutzt, wenn die Verbindung Massenspektrometrie unterworfen wird, und 
ausgewahit ist aus tertiarem Amin, quartarem Amin und organischer Saure; 

MOI ein Nukleinsaurefragment ist; und 

wobei Verbindungen in einem Satz nicht-identische T m3 -Gruppen aufweisen, aber diese dieselben Elutions- 
zeiten haben, gemessen mit HPLC unter Verwendung einer Saule aus nicht-porosen 2,3 jim groBen Poly 
(styrol-divinyibenzo!)-C ia -Teifchen und waBrigen Puffers, der 100 mM Trtethylammoniumacetat umfaBt. 

48. Kit zum Genotypisieren, der eine Mehrzahl von Amplifikationsprimerpaaren umfaBt, wobei wenigstens einer der 
Primer die Formel hat: 



T -L-MO! 

in der 

T ms eine durch Massenspektrometrie nachweisbare organische-Gruppe ist, die Kohlenstoff, wenigstens eines 
von Wasserstoff und Fluorid und fakuitative Atome umfaBt, die ausgewahit sind aus Sauerstoff, Stickstoff, 
Schwefel, Phosphor und lod; 

L eine organische Gruppe ist, die ermoglicht, eine einzigartige T m3 -enthaltende Einheit vom Rest der Verbin- 
dung abzuspalten, wobei die T ms -enthaitende Einheit eine funktionelle Gruppe umfaBt, die einen einzigen 
ionisierten Ladungszustand unterstutzt, wenn die Verbindung Massenspektrometrie unterworfen wird, und 
ausgewahit ist aus lerttarem Amin, quartarem Amin und organischer Saure; 
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MO I ein Nukleinsaurefragment ist; und 

jedes Primarpaar mit einem unterschiedlichen Locus assoziiort. 

49. Kit nach Anspruch 48, dadurch gekennzetchnet, daG die Mehrzahl wenigstens 3 ist. 

50. Kit nach Anspruch 48, dadurch gekennzeichnot, da3 die Mehrzahl wenigstens 5 ist. 



Revendications 

1. Procede pour determiner la presence d'une molecule d'acide nucleique, comprenant les stapes consistant : 

(a) a engendrer des molecules d'acide nucleique marquees a partir d'une ou plusieurs molecules d'acide 
nucleique cibles choisies, dans lesquelles un marqueur presente une correlation avec un fragment d'acide 
nucleique particulier et est detectable par spectrometrie ou potentiometrie ; 

(h) a separor lesdites molecules marquees selon ios dimGnsicns ; 

(c) a diver ledit marqueur de ladite molecule marquee ; et 

(d) a detecter ledit marqueur par spectrometrie ou potentiometrie et, d'apres cette detection, a determiner la 
presence de ladite molecule d'acide nucleique. 

2. Procede pour detecter une molecule d'acide nucleique choisie, comprenant les etapes consistant : 

(a) a associer une sonde d'acide nucleique marquee avec des. molecules d'acide nucleique cibles dans des 
conditions et pendant un temps suffisant pour permottre ('hybridation de ladite sonde d'acide nucleique mar- 
quee a une sequence d'acide nucleique cible choisie cornplementaire, ladite sonde d'acide nucleique marquee 
comprenant un marqueur qui presente une correlation avec un fragment particulier et qui est detectable par 
spectrometrie ou potentiometrie ; 

(b) a modifier les dimensions desdites sondes marquees hybridees, les dimensions des sondes non hybridees 
ou des molecules cibles ou les dimensions des hybrides sondes de point cible ; 

(c) a separer les sondes marquees suivant les dimensions ; 

(d) a diver ledit marqueur de ladite sonde marquee ; 

(e) a detecter ledit marqueur par spectrometrie ou potentiometrie, et, d'apres cette detection, a detecter ladite 
molecule d'acide nucleique choisie. 

3. Procede suivant Tune quelconque des revendications 1 et 2, dans lequel la detection du marqueur est effectuee 
par spectrometrie de masse, spectrometrie infrarouge, spectrometrie ultraviolette ou amperometrie potentiostati- 
que. 

4. Procede suivant Tune quelconque des revendications 1 a 3, dans lequel plus de quatre fragments d'acide nucleique 
marques sont engendres et chaque marqueur est particulier a un fragment d'acide nucleique choisi. 

5. Procede suivant Tune quelconque des revendications 2 et 3, dans iequel plus de quatre sondes d'acide nucleique 
marquees sont utilisees et chaque marqueur est particulier a une sonde d'acide nucleique choisie. 

6. Procede suivant Tune quelconque des revendications 1 a 5, dans (equel la molecule d'acide nucleique cible est 
engendree par extension d'amorce. 

7. Procede suivant Tune quelconque des revendications 2 a 5, dans lequel les dimensions des sondes marquees 
hybridees, des sondes non hybridees ou des molecules cibles ou bien des hybrides sonde:cible sont modifiees 
par un procede choisi dans le groupe consistant en une extension avec une polymerase, une ligation, une digestion 
avec une exonuclease et une digestion avec une endonuclease. 

8. Procede suivant Tune quelconque des revendications 1 a 7. dans lequel les molecules marquees sont separees 
par un procede choisi dans le groupe consistant en I'electrophorase sur gel, I'electrophorese capillaire, I'electro- 
phorese dans des microcanaux, la CLHP, la chromatographie d'exciusion dimensionnelle et la filtration. 

9. Procede suivant Tune quelconque des revendications 1 a 8, dans lequel les particules marquees sont clivees par 
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un precede choisi dans le groupe consistant en das precedes d'oxydation, de reduction, de labilite en milieu acide, 
de labilite en milieu basique, des proced^s enzymatiques, des precedes electrochimiques, des precedes thermi- 
ques et des process de photolabilite\ 

10. Precede suivant Tune quelconque des revendications 1 a 9, dans iequel le marqueur est detecte par spectrometrie 
de masse a temps de vol, spectrometrie do masse quadripolaire. spectrometrie de masse a secteur magnetique 
at spectrometrie do masse a secteur electriquo. 

1 1 . Precede suivant Tune quelconque des revendications 1 a 9, dans Iequel le marqueur est detecto par amperometrie 
potentiostatique en utilisant des detecteurs choisis dans le groupe consistant en des detecteurs coulometriques 
ot des detecteurs amperometriques. 

12. Procede suivant Tune quelconque des revendications 1 a 11 , dans Iequel les stapes de separation, de clivage et 
de detection sont effectuees de maniere continue. 

13. Procede suivant Tune quelconque des revendications 1 a 12, dans Iequel les etapes de separation, de clivage et 
de detection sont effectuees de maniere continue sur un dispositif unique. 

14. Proc&ci& suivant Tune quelconque des revendications 1 a 13, dans Iequel les etapes de separation, de clivage et 
de detection sont automatisees. 

1 5. Procede suivant Tune quelconque des revendications 1 a 1 4, dans Iequel les molecules ou sondes marquees sont 
engendrees a partir d'amorces oligonucieotidiques non marquees en position 3\ 

1 6. Procede suivant Tune quelconque des revendications 1 a 1 4, dans Iequel les molecules ou sondes marquees sont 
engendrees a partir de terrninateurs didesoxynucleotidiques marques. 

17. Procede suivant I'une quelconque des revendications 1-2, 4-9 ot 12-16, dans Iequel le marqueur est detecto par 
spectrometrie non fluoroescente ou potentiometrie. 

18. Procede pour le genotypage d'un organisme choisi, comprenant les etapes consistant : 

(a) a engendrer des molecules d'acide nucleique marquees a partir d'une molecule cible choisie, un marqueur 
presentant une correlation avec un fragment particular et pouvant etre detecte par spectrometrie ou 
potentiometrie ; 

(b) a separer lesditos molecules marquees suivant !os dimensions ; 

(c) a diver ledit marqueur de iadite molecule marquee ; 

(d) a detecter ledit marqueur par spectrometrie ou potentiometrie, et, d'apres cette detection, a determiner le 
genotype de cet organisme. 

19. Procede pour le genotypage d'un organisme choisi, comprenant les etapes consistant : 

(a) a associer une molecule d'acide nucleique marquee a une molecule cible choisie dans des conditions et 
pendant un temps suffisant pour permettre ('hybridation de Iadite molecule marquee a Iadite molecule cible, 
un marqueur presentant une correlation avec un fragment particulier et pouvant etre detecte par spectrometrie 
ou potentiometrie ; 

(b) a separer lesdites molecules marquees suivant les dimensions ; 

(c) a diver ledit marqueur de Iadite molecule marquee ; et 

(d) a detecter ledit marqueur par spectrometrie ou potentiometrie et, d'apres cette detection, a determiner le 
genotype dudit organisme. 

20. Procede suivant la revendication 18 ou 19, dans Iequel les molecules marquees sont engendrees a partir d'un 
ensemble de clones choisis dans le groupe consistant en des clones genomiques, des clones d'ADNc et des 
clones d'ARN. 

21. Procede suivant la revendication 18 ou 19, dans Iequel les molecules marquees sont engendrees par reaction en 
chains avec une polymerase. 
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22. Precede suivant la revendication 18 ou 19, dans Iequel la molecule d'acide nucleique cjble est ongendrde par 
extension d'amorce. 

23. Procede suivant i'une quelconque des revendications 18 a 22, dans lequei la detection du marqueur est effectuee 
par spectrometry de masse, spectrometrie infrarouge, spectrometrie ultravioiette ou amperomotrio potentiostati- 
que. 

24. Precede suivant I'uno quelconque des revendications 18 a 23, dans Iequel plus do quatre molecules d'acide nu- 
cleique marquees sont engendrSes et chaque marqueur est particulier a un fragment d'acide nucleique choisi. 

25. Procede suivant Tune quelconque des revendications 18 a 24, dans iequel les molecules marquees sont separees 
par un procede choisi dans le groupe consistant en I'electrophorese sur gel, I'electrophorese capilfaire, I'electro- 
phorese dans des microcanaux, la CLHP, la chromatography d'exclusion dimensionnello et la filtration. 

26. Procede" suivant I'une quelconque des revendications 18 a 25, dans Iequel les molecules marquees sont clivees 
par un procede choisi dans le groupe consistant en des procedes d'oxydation, de reduction, de lability en milieu 

acide. de labilite en milieu basique, des precedes onzymatiques. des precedes electrochimiques, des procedes 
(hermiques et des procedes de photolabilite. 

27. Procede suivant I'une quelconque des revendications 18 a 26, dans Iequel le marqueur est d6tect3 par spectro- 
metrie de masse a temps de vol, spectrometrie de masse quadripolaire, spectrometrie de masse a secteur ma- 

rjnotiquo et spectrometrie de masse a secteur electrique. 

28. Procede suivant I'une quelconque des revendications 18 a 26, dans Iequel le marqueur est detecte par ampero- 
mbtrie potentiostatique en utilisant des detecteurs choisis dans le groupe consistant en des detecteurs coulome- 
tnques et des detecteurs amperometnques 

29. Procede suivant I'une quelconque des revendications 18 a 28, dans Iequel les etapes de separation, de clivage 
ot do detection sent mises en oeuvre de mani&re continue. 

30. Procede suivant I'une quelconque des revendications 18 a 29, dans iequel les etapes de separation, de clivage 
et de detection sont mises en oeuvre de maniere continue sur un dispositif unique. 

31. Procede suivant I'une quelconque des revendications 18 a 30, dans Iequel les etapes de separation, de clivage 
et de detection sont automatisees; 

32. Procede suivant Tune quelconque des revendications 13 a 31, dans Iequel les molecules marquees sont engen- 
drees a partir d'amorces oligonucleotidiques non marquees en position 3'. 

33. Procede" suivant I'une quelconque des revendications 18 a 31, dans Iequel les molecules marquees sont engen- 
drees a partir de terminateurs didesoxynucleotidiques marques. 

34. Procede suivant I'une quelconque des revendications 1 a 33, dans Iequel la molecule cible est obtenue a partir 
d'un echantillon biologique. 

35. Procede suivant Tune quelconque des revendications 18 a 34, dans Iequel le marqueur est detecte par spectro- 
metrie non fluorescente ou potentiometrie. 

36. Composition comprenant une pluralite de composes de formule : 

f^-L-MOl 

dans laquelle, 

T™ 3 represente un groupe organique detectable par spectrometrie de masse, comprenant du carbone, au 
moins un des elements consistant en hydrogene et fluor, et des atomes facultatifs choisis entre I'oxygene, 
Pazote, le soufre, le phosphors et Node ; 
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L represente un groupe organique qui permet a un groupement particulier contenant T™* d'etre clivd du reste 
du compose, le groupement contenant T m « comprenant un groupe fonctionnel qui entretient un etat de charge 
ionis6 unique lorsque le compose est soumis a une spectrometrie de masse et est choisi entre une amine 
tertiaire, une amine quaternaire et un acide organique ; 
MO! (molecule interessante) represente un fragment d'acide nucleique ; et 

au moins deux composes ont la meme valeur de T m3 mais les groupes MOI de ces molecules ont des longueurs 
de sequence de nucleotides non identiques, les fragments MOI ne se terminant pas par le meme did^soxy- 
nucleotide choisi entre ddAMR ddCMR ddGMP et ddTMP. 

37. Composition comprenant une piuralite de composes de formule : 

T m3 -L-MOI 

dans laquelle, 

T ms represente un groupe organique detectable par spectrometrie de masse, comprenant du carbone, au 
moins un des elements consistant en hydrogene et fluor, et des atomes facultatifs choisis entre I'oxygene, 
I'azote, le soufre, le phosphore et I'iode ; 

L represente un groupe organique qui permet a un groupement particulier contenant T ms d'etre ciive du reste 
du compose, le groupement contenant T ms comprenant un groupe fonctionnel qui entretient un etat de charge 
ionise unique lorsque !e compose est soumis a une spectrometrie de masse et est choisi entre une amine 
tertiaire, une amine quaternaire et un acide organique ; 
MO! represente un fragment d'acide nucleique ; et 

au rnoins deux composes ont la meme valeur de T ma mais ces composes ont des temps d'elution non-iden- 
tiques, mesures par CLHP en utilisant une colonne de particules non poreuses de poiy(styrene-divinylbenze- 

ne)-C li3 de 2,3 u.m ot un tampon aqueux comprenant de I'acetate de triethylammonium 100 rnM. 

38. Composition comprenant une piuralite de composes de formule : 



dans laquelle, 

T ms represente un groupe organique detectable par spectrometrie de masse, comprenant du carbone, au 
moins un des elements consistant en hydrogene et fluor, et des atomes facultatifs choisis entre I'oxygene, 
I'azote, le soufre, le phosphore et I'iode ; 

L represente un groupe organique qui permet a un groupement particulier contenant T m3 d'etre ciive du reste 
du compose, le groupement contenant T miS comprenant un groupe lonctionnel qui entretient un etat de charge 
ionise unique lorsque le compose est soumis a une spectrometrie de masse et est choisi entre une amine 
tertiaire, une amine quaternaire et un acide organique ; 
MOI represente un fragment d'acide nucleique ; et 

il n'existe pas deux composes ayant ia meme longueur de sequence de nucleotides de MOI qui ont egalement 
la meme valeur de T ms . 

39. Composition suivant I'une quelconque des revendications 36 a 38, dans laquelle ia piuralite correspond a une 
valeur superieure a 2. 

40. Composition suivant I'une quelconque des revendications 36 a 38, dans laquelle la piuralite correspond a une 
valeur superieure a 4. 

41. Composition suivant Tune quelconque des revendications 36 a 40, dans laquelle le fragment d'acide nucleique a 
une sequence complementaire d'une partie d'un vecteur, ce fragment etant apte a f'amorcage de la synthese de 
polynucleotides. 



42. Composition suivant I'une quelconque des revendications 36 a 41, dans laquelle les groupes T ms des membres 
de ia piuralite different d'une valeur d'au moins 2 amu. 
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43. Composition suivant Tune quelconque des revendications 36 a 41, dans laquelle les groupes T™* des membres 
da la plurality different d'une valeur d'au moins 4 amu. 

44. Composition cornprenant une plurality de series da composes, chaque sane de composes repondant a la formula : 



dans laquelle, 

T ms represente un groupe organique detectable par spectrometrie de masse, cornprenant du carbone, au 
moins un des elements consistant en hydrogene et fluor, et des atomes facultatifs choisis entre I'oxygene, 
i'azote, le soufre, le phosphore et I'iode ; 

L represente un groupe organique qui permet a un groupement particulier contenant T 015 d'etre ciive du reste 
du compose, le groupement contenant T m5 cornprenant un groupe fonctionnel qui entretient un etat de charge 
ionise unique lorsque le compose est soumis a une spectrometrie de masse et est choisi entre une amine 
tortiairo, une amine quatemaire et un acide organique ; 
MOI represente un fragment d'acide nucleique ; et 

les membres d'une premiere serie de composes ont des groupes T ms identiques et ont cependant des groupes 
MOI non identiques avec des nombres differents de nucleotides dans le groupe MOI et il existe au moins dix 
membres dans la premiere serie, les groupes T m3 diffe>ant d'une valeur d'au moins 2 amu entra les series. 

45. Composition suivant la revendication 44 ; dans laquelle la pluralite correspond a une valeur d'au moins 3. 

46. Composition suivant la revendication 44 ; dans laquelle la pluralite correspond a une valeur d'au moins 5. 

47. Composition cornprenant une pluralite de series do composes, chaque serie de composes repondant a la formule 



dans laquelle, 

T mr - represente un groupe organique detectable par spectrometrie de masso, cornprenant du carbone, au 
moins un des elements consistant en hydrogene et fluor, et des atomes facultatifs choisis entre i'oxygene, 
I'azote, le soufre, le phosphore et I'iode ; 

L represente un groupe organique qui permet a un groupement particulier contenant T ms d'etre clive du reste 
du compose, le groupement contenant T ms cornprenant un groupe fonctionnel qui entretient un etat de charge 
ionise unique lorsque le compose est soumis a une spectrometrie de masse et est choisi entre une amine 
tertiaire, une amine quaternaire et un acide organique ; 
MOI represente un fragment d'acide nucleique ; et 

les composes dans une serie ont des groupes T ms non identiques mais ont des temps d'elution identiques 
mesures par CLHP en utilisant une colonne de particules non poreuses de poly(styrene-divinylbenzene)-C l8 
de 2.3 u.m et un tampon aqueux cornprenant de I'acetate de triethylammonium a 100 mM. 

48. Kit de genotypage, cornprenant une pluralite de paires d'amorces d'amplification, dans laquelle au moins une des 
amorces repond a la formule : 



T -L-MOI 

dans laquelle, 

T ms represente un groupe organique detectable par spectrometrie de masse, cornprenant du carbone, au 
moins un des elements consistant en hydrogene et fluor, et des atomes facultatifs choisis entre I'oxygene, 
I'azote, le soufre, le phosphore et I'iode ; 

L represente un groupe organique qui permet a un groupement particulier contenant T ms d'etre clive du reste 
du compose, le groupement contenant T ms cornprenant un groupe fonctionnel qui entretient un etat de charge 
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ionise* unique lorsque la compose" est soumis a une spectromdtrie de masse et est cholsi entre une amine 
tertiaire, une amine quaternaire et un acide organique ; et 

MO! reprSsente un fragment d'acide nucle'ique ; et chaque paire d'amorces s'associe a un locus different. 

49. Kit suivant la revendication 48, dans lequel la plurality correspond a une valeur d'au moins 3. 

50. Kit suivant la revendication 48, dans lequel la plurality correspond k une valeur d'au moins 5. 
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